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REMARKS 

Claims 1-5 remain present for examination. Applicants respond below to the specific 
rejections raised by the Examiner in the final Office Action mailed December 7, 2004. For the 
reasons set forth below. Applicants respectfiiUy traverse. 

Rejection under 35 V.S.C. SlOl - Utility 

The Examiner has maintained the rejection of the pending claims under 35 U.S.C. § 101 
as lacking patentable utility. The Examiner alleges that the specification provides data showing 
a small increase in DNA copy number, about 2-fold, in some normal tissues and cancer, 
however, there is no evidence regarding whether or not PR01327 mRNA or polypeptide levels 
are also increased in these cancers. The Examiner cites Pennica et al. in support of her position 
that there is often a lack of correlation between DNA amplification and increased peptide levels. 
The Examiner states that Haynes et al. showed that polypeptide levels cannot be accurately 
predicted fi-om mRNA levels, and that Hu et al. found that for genes displaying a 5-fold change 
or less in tumors compared to normal, there was no evidence of a correlation between altered 
gene expression and a known role in the disease. The Examiner concludes that, given the small 
increase in DNA copy number of PRO 1327, and the evidence provided by the current literature, 
it is clear that one skilled in the art would not assume that a small increase in gene copy number 
would correlate with significantly increased mRNA or polypeptide levels. Further research is 
required, such that the asserted utility is not yet in currently available form. 

Applicants respectfully submit that the Examiner has misunderstood the data presented in 
the specification, and that the claimed antibodies have utility in the field of cancer diagnostics 
and therapeutics. 

Utility need NOT be proven to an Absolute Certainty - a Correlation between the Evidence and 

the Asserted Utility is Sufficient 

Compliance with 35 U.S.C. § 101 is a question of fact. Raytheon v. Roper, 724 F.2d 951, 

956, 220 USPQ 592, 596 (Fed. Cir. 1983) cert, denied, 469 US 835 (1984). The evidentiary 

standard to be used throughout ex parte examination in setting forth a rejection is a 

preponderance of the evidence, or "more likely than not" standard. In re Oetiker, 977 F.2d 1443, 

1445, 24 USPQ2d 1443, 1444 (Fed. Cir. 1992). This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
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provide evidence such that it establishes an asserted utiUty as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not . M.P.E.P. at § 2107.02, part VII (2004) (emphasis in 
original, intemal citations omitted). 

The PTO has the initial burden to offer evidence "that one of ordinary skill in the art 
would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 
1436 (Fed. Cir. 1995). Only then does the burden shift to the Applicant to provide rebuttal 
evidence. Id. As stated in the M.P.E.P., such rebuttal evidence does not need to absolutely 
prove that the asserted utility is real. Rather, the evidence only needs to be reasonably indicative 
of the asserted utility. 

In Fujikawa v. Wattanasin, 93 F.3d 1559, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996), the Court 

of Appeals for the Federal Circuit upheld a PTO decision that in vitro testing of a novel 

pharmaceutical compound was sufficient to establish practical utility, stating the following rule: 

[T]esting is often required to establish practical utility. But the test resuhs need 
not absolutely prove that the compound is pharmacologically active. All that is 
required is that the tests be ''reasonably indicative of the desired 
[pharmacological] response." In other words, there must be a sufficient 
correlation between the tests and an asserted pharmacological activity so as to 
convince those skilled in the art, to a reasonable probability, that the novel 
compound will exhibit the asserted pharmacological behavior." Fujikawa v. 
Wattanasin, 93 F.3d 1559, 1564, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996) (intemal 
citations omitted, bold emphasis added, italics in original). 

While the Fujikawa case was in the context of utility for pharmaceutical compounds, the 
principals stated by the Court are appUcable in the instant case where the asserted utility is for a 
diagnostic use - utility does not have to be established to an absolute certainty, rather, the 
evidence must convince a person of skill in the art "to a reasonable probability." In addition, the 
evidence need not be direct, so long as there is a "sufficient correlation" between the tests 
performed and the asserted utility. 

The Court in Fujikawa relied in part on its decision in Cross v. Ilzuka, 753 F.2d 1040, 
224 U.S.P.Q. 739 (Fed. Cir. 1985). In Cross, the Appellant argued that basic in vitro tests 
conducted in cellular fractions did not establish a practical utility for the claimed compounds. 
Appellant argued that more sophisticated in vitro tests using intact cells, or in vivo tests, were 
necessary to establish a practical utility. The Court in Cross rejected this argument, instead 
favoring the argument of the Appellee: 
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[I]n vitro results... are generally predictive of in vivo test results, i.e., there is a 
reasonable correlation therebetween. Were this not so, the testing procedures of 
the pharmaceutical industry would not be as they are, [Appellee] has not urged, 
and rightly so, that there is an invariable exact correlation between in vitro test 
results and in vivo test results. Rather, [Appellee's] position is that successful in 
vitro testing for a particular pharmacological activity establishes a significant 
probability that in vivo testing for this particular pharmacological activity will be 
successful. Cross v. Ilzuka, 753 F.2d 1040, 1050, 224 U.S.P.Q. 739 (Fed. Cir. 
1 985) (emphasis added). 

The Cross case is very similar to the present case. Like in vitro testing in the 
pharmaceutical industry, those of skill in the field of biotechnology rely on the reasonable 
correlation that exists between gene expression and protein expression (see below). Were there 
no reasonable correlation between the two, the techniques that measure gene levels such as 
microairay analysis, differential display, and quantitative PGR would not be so widely used by 
. those in the art. As in Cross, Applicants here do not argue that there is "an invariable exact 
correlation" between gene expression and protein expression. Instead, Applicants' position 
detailed below is that a measured differential expression of mRNA in cancer cells establishes a 
"significant probability" that the encoded polypeptide will also be differentially expressed in the 
same cancer cells based on "a reasonable correlation therebetween." 

Taken together, the legal standard for demonstrating utility is a relatively low hurdle. An 
Applicant need only provide evidence such that it is more likely than not that a person of skill 
in the art would be convinced, to a reasonable probability, that the asserted utility is true. 
The evidence need not be direct evidence, so long as there is a reasonable correlation between 
the evidence and the asserted utility. The standard is not absolute certainty. 

Even assuming that the PTO has met its initial burden to offer evidence that one of 
ordinary skill in the art would reasonably doubt the truth of the asserted utility, Applicants assert 
that they have met their burden of providing rebuttal evidence such that it is more likely than not 
those skilled in the art, to a reasonable probability, would believe that the PR01327 polypeptide 
and its antibodies are useful as diagnostic tools for cancer. 

PR01327 mRNA has a specific and substantial utility with regard to dia&tosin ^ and treating 

cancer 

Applicants submit that the evidence of record establishes a specific and substantial utility 
for the PRO 1327 polypeptides and antibodies with regard to the diagnosis and treatment of 
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cancer. The Examiner states that "it is important to note that the instant specification provides 
no information regarding increased mRNA levels of PRO 1270 [sic] in tumor samples as 
contrasted to normal tissue samples: Only gene amplification data were presented." Applicants 
respectfully point out that the Examiner is incorrect in her interpretation of the data provided in 
the specification. The data provided in Example 18 of the instant specification shows that cDNA 
encoding PRO 1327 is more highly expressed in normal esophagus, stomach, lung, rectum, and 
skin as compared to tumors in these same tissue types. Because cDNA libraries are prepared by 
isolating mRNA from a particular tissue and converting it to the corresponding cDNA, the 
expression data in Example 1 8 reflect levels of mRNA in the tested tissue types. The differential 
expression of PR01327 polypeptide-encoding mRNAs in esophagus, stomach, lung, rectum, and 
skin tumors relative to normal tissues of the same type render the mRNAs usefiil for diagnosing 
and treating tumors in these tissues. Accordingly, the Examiner's assertions of uncertainty in the 
art regarding whether gene amplification translates to increased protein levels are not relevant to 
whether the claimed polypeptides and antibodies have utility. The differential expression of 
PRO 1327 mRNAs in esophagus, stomach, lung, rectum, and skin tumors relative to normal 
tissues of the same type gives the mRNAs utility in diagnosing and/or treating cancer. A 
correlation between mRNA levels and protein levels, as established below, is sufficient to confer 
a similar utility on the PRO 1327 polypeptides and antibodies. 

Those of skill in the art believe that a correlation more likely than not exists between levels of 
mRNA and corresponding proteins 

AppUcants respectfiiUy submit that a specific and substantial utility for the PRO 1327 
polypeptides and antibodies can be estabUshed by showing that those of skill in the art believe 
that, more likely than not, there is a general correlation between mRNA levels and protein levels, 
as demonstrated below. 

Applicants submit that the evidence of record demonstrates a general correlation between 
levels of mRNA and levels of the corresponding proteins, and that in light of this evidence the 
Examiner has not carried her burden of establishing that one of skill in the art would, more likely 
than not, believe there is no such correlation. As discussed above, evidence of utility does not 
have to be to an absolute certainty, and therefore there does not need to be a necessary 
cormection between mRNA expression and protein expression. Rather, there need only be a 
reasonable correlation between the evidence offered and the asserted utility such that it is more 
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likely than not that a person of skill in the art would be convinced, to a reasonable probability, 
the asserted utility is true. 

Applicants have previously submitted a copy of a Declaration by J. Christopher Grimaldi, 
an expert in the field of cancer biology. As stated in paragraph 5 of the declaration, "Those who 
work in this field are well aware that in the vast majority of cases, when a gene is over- 
expressed. . .the gene product or polypeptide will also be over-expressed. ... This same principal 
• applies to gene under-expression." The declaration further provides, "the detection of increased 
mRNA expression is expected to result in increased polypeptide expression, and the detection of 
decreased mRNA expression is expected to result in decreased polypeptide expression. The 
detection of increased or decreased polypeptide expression can be used for cancer diagnosis and 
treatment." The references cited in the declaration and submitted previously, as well as those 
submitted herewith, support this statement. 

Similarly, Applicants have also submitted the declaration of Paul Polakis, Ph.D., an 
expert in the field of cancer biology, and the principal investigator of the project that led to the 
discovery of the PRO 1327 polypeptides. Dr. Polakis explains that over 200 genes were 
identified as being present in human tumor cells in significantly higher levels than in 
corresponding normal cells, and that for the those mRNAs for which the levels of protein 
expression have been examined, -80% had correspondinglv higher levels of the encoded protein . 
Polakis Declaration at paragraphs 4 and 5. In addition, Dr. Polakis also states in paragraph 6 of 
his declaration: 

Based on my own experience accumulated in more than 20 years of research, 
including the data discussed in paragraphs 4 and 5 above [showing a positive 
correlation between mRNA levels and encoded protein levels in the vast majority 
of cases] and my knowledge of the relevant scientific literature, it is my 
considered scientific opinion that for human genes, an increased level of mRNA 
in a tumor cell relative to a normal cell typically correlates to a similar increase in 
abundance of the encoded protein in the tumor cell relative to the normal cell. In 
fact, it remains a central dogma in molecular biology that increased mRNA levels 
are predictive of corresponding increased levels of the encoded protein. 
(Emphasis added). 

Dr. Polakis acknowledges that there are published cases where such a correlation does not exist, 
but states his opinion that "such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the encoded 
protein." (Polakis Declaration, paragraph 6). 
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The statements of Grimaldi and Polakis are supported by the teachings in Molecular 
Biology of the Cell, a leading textbook in the field (Bruce Alberts, et al. Molecular Biology of 
the Cell (4^'' ed. 2002) submitted herewith as Exhibit 1). Figure 6-3 on page 302 illustrates the 
basic principle that there is a correlation between increased gene expression and increased 
protein expression. The accompanying text states that "a cell can change (or regulate) the 
expression of each of its genes according to the needs of the moment - most obviously by 
controlling the production of its mRNA, " Id. at 302, emphasis added. Similarly, figure 6-90 
on page 364 illustrates the path from gene to protein. The accompanying text states that while 
potentially each step can be regulated by the cell, ' ^the initiation of transcription is the most 
common point for a cell to regulate the expression of each of its genes ." Id. at 364. This 
point is repeated on page 379, where the authors state that of all the possible points for regulating 
protein expression, ' 'Iflor most genes transcriptional controls are paramount ." Id. at 379. 

Additional support is found in Zhigang et al. World Joumal of Surgical Oncology 2:13, 
2004, submitted herewith as Exhibit 2. Zhigang studied the expression of prostate stem cell 
antigen (PSCA) protein and mRNA to validate it as a potential molecular target for diagnosis and 
treatment of human prostate cancer. The data showed "a high degree of correlation between 
PSCA protein and mRNA expression" (see page 6 of Exhibit 2). Of the samples tested, 81 out of 
87 showed a high degree of correlation between mRNA expression and protein expression. The 
authors conclude that "it is demonstrated that PSCA protein and mRNA overexpressed in human 
prostate cancer, and that the increased protein level of PSCA was resulted from the upregulated 
transcription of its mRNA." Exhibit 2, page 11. Even though the correlation between mRNA 
expression and protein expression occurred in 93% of the samples tested, not 100%, the authors 
state that "PSCA may be a promising molecular marker for the clinical prognosis of human Pea 
and a valuable target for diagnosis and therapy of this tumor." Id 

Further, Meric et al.. Molecular Cancer Therapeutics, vol. 1, 971-979 (2002), submitted 
herewith as Exhibit 3, states the following: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
efforts have concentrated on identifying differences in gene expression at the 
level of mRNA, which can be attributable to either DNA amplification or to 
differences in transcription. Meric et al. at 971 (emphasis added). 
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This statement provides additional support for Applicants' asserted utility. It is true that there is 
no necessary correlation between gene expression and protein expression because there are other 
mechanism for regulating gene expression. However, were there no significant correlation 
between gene expression and protein levels, exploiting differences in gene expression between 
cancer cells and normal cells would not be a "fundamental principle of molecular therapeutics in 
cancer." Moreover, as mentioned above, Applicants need not establish a necessary connection 
between gene expression and protein expression. Rather, there need only be a reasonable 
correlation between the evidence offered and the asserted utility such that it is more likely than 
not that a person of skill in the art would be convinced, to a reasonable probability, that the 
asserted utility is true. 

Together, the declarations of Grimaldi and Polakis, the accompanying references, the 
excerpts from "Molecular Biology of the Cell", and the Zhigang et al. and Meric et al. articles 
establish that the accepted understanding in the art is that there is a reasonable correlation 
between gene expression and the level of the encoded protein. Considering that utility does not 
have to be proven to an absolute certainty, AppHcants submit they have provided sufficient 
evidence to show a reasonable correlation between mRNA expression and the level of PRO 1327 
protein. Applicants further submit they have established that it is more likely than not that one of 
skill in the art would believe that, because the PR01327 mRNA is expressed at a lower level in 
esophagus, stomach, lung, rectum, and skin tumors relative to normal tissues of the same type, as 
evidenced by the data set forth in Example 18, the PR01327 polypeptide will also be expressed 
at a lower level in esophagus, stomach, lung, rectum, and skin tumors relative to normal tissues 
of the same type. One of skill in the art would recognize that a protein which is differentially 
expressed in certain cancer cells compared to the corresponding normal tissue, as well as 
antibodies specific for such a protein, would have utiUty as a diagnostic tool. Thus, AppUcants 
submit it is more likely than not that one of skill in the art would recognize the asserted utility of 
the claimed antibodies as a cancer diagnostic tool. 

Even if the Examiner has established a prima facie case o f non-utilitv. Applicants have 
rebutted the Examiner's assertions 

The Examiner cites Pennica et al, Haynes et al., and Hu et al. as evidence that one skilled 
in the art would not assume that a small increase in gene copy number would correlate with 
significantly increased mRNA or polypeptide levels. Based on the cited references, the 
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Examiner concludes that one of skill in the art would not assume a correlation between gene 
copy number and protein levels exists, but rather would perform experiments to verify it. 
Applicants again point out that the expression data in Example 18 reflect levels of mRNA in the 
tested tissue types, and address each of the references in tum. 
Pennica et al. 

The Examiner also cites Pennica et al. in support of the assertion that it is not the norm 
that DNA amplification correlates with corresponding protein levels. Pennica found that the 
WISP-2 gene DNA was amplified in colon tumors, but its mRNA expression was significantly 
reduced in the majority of tumors relative to normal colonic mucosa from the same patient. The 
evidence in Pennica of a lack of correlation between DNA copy number and mRNA levels says 
nothing about a lack of correlation between the level of mRNA and the level of protein 
expression - in fact, Pennica did not even examine protein expression. It is the correlation 
between mRNA level and the level of protein expression which is at issue here, not the 
correlation of gene copy number and mRNA levels. The data Applicants report in Example 18 
indicate there are fewer copies of the mRNA encoding PR01327 polypeptides in esophagus, 
stomach, lung, rectum, and skin tumors relative to normal tissues of the same type, and thus 
whether gene amplification leads to increased mRNA is not at issue. Nothing in Pennica is 
contrary to Applicants' assertion that it is well-established in the art that the level of protein is 
positively correlated to the level of mRNA. Applicants respectfully submit that by relying on 
Pennica, the PTO is confiising the relationship between an increase in copy number of a gene 
and increased levels of the corresponding mRNA transcripts on the one hand, with the 
relationship between mRNA expression and levels of the corresponding protein on the other, 
only the latter of which is at issue here. 

Havnes et al 

The Examiner asserts that Haynes et al. (1998, Electrophoresis, 19:1862-1871) found that 
polypeptide levels cannot be accurately predicted fi-om mRNA levels when studying protein 
expression in Saccharomyces cerevisiae. In particular, the Examiner points to the finding that, 
for some genes, equivalent mRNA levels translated into protein abundances which varied by 
more than 50-fold. 

Applicants submit that Haynes does not contradict the utility or enablement of the instant 
claims. Haynes is a review article dealing with the art of proteome analysis. The assertions in 
Haynes cited by the Examiner were made in an effort to identify shortcomings in the art of 
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mRNA quantification to argue for "proteome analysis to become an essential component in the 
comprehensive analysis of biological systems." Haynes, p, 1863. Haynes studied 80 selected 
samples from Saccharomyces cerevisiae, and reported "a general trend but no strong correlation 
between protein and transcript levels (Fig. 1)." Id. However, a cursory inspection of Fig. 1 
shows a clear correlation between the mRNA levels and protein levels measured. This 
correlation is confirmed by an inspection of the full-length research paper from which the data in 
Fig. 1 were derived, presented herein as Exhibit 4 (Gygi et al, Molecular and Cellular Biology, 
Mar. 1999, 1720-1730). Gygi states that "there was a general trend of increased protein levels 
resulting from increased mRNA levels," with a correlation coefficient of 0.935, indicating a 
strong correlation. Gygi, p. 1726. Moreover, Gygi also states that the correlation is especially 
strong for highly expressed mRNAs. Id. Considering that Example 18 of the specification 
shows higher expression of PR01327 mRNA in normal esophagus, stomach, lung, rectum, and 
skin tissue relative to tumors in these same tissue types, Haynes and Gygi actually provide strong 
evidence in support of a general correlation between mRNA and protein levels, and thus the 
utility of the claimed PR01327 polypeptides and antibodies. 

The 50-fold variation referred to by the Haynes and cited by the Examiner, does not in 
any way show the absence of a correlation between mRNA and protein levels, but rather 
identifies the outer limits of variabiUty in the authors* experiments. This variabiUty may support 
the authors* assertion that the amount of a particular protein cannot accurately predict the 
particular level of the corresponding mRNA transcript, but it does not suggest an absence of a 
general correlation between mRNA and protein levels. Again, Applicants' utility is based on the 
differential expression of mRNA in esophagus, stomach, lung, rectum, and skin tumors relative 
to normal tissues of the same type. Exact levels of expression are irrelevant. Moreover, Gygi 
states that the high degree of variability seen at low levels of mRNA (shown in inset of Fig. 1, 
Haynes p. 1863) is due to the fact that "the magnitude of the error in the measurement of mRNA 
levels is inversely proportional to the mRNA levels." Gygi, p. 1727. Considering that PR01327 
mRNA has been shown in Example 18 of the specification to be more highly expressed in 
normal esophagus, stomach, lung, rectum, and skin tissue than in tumors in these same tissue 
types, the variability identified by Haynes is even less applicable to establishing the absence of a 
correlation between mRNA and protein levels in the instant case. 

As stated above, the standard for utility is not absolute certainty, but rather whether one 
of skill in the art would be more likely than not to believe the asserted utility. Here, the utihty of 
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PRO 1327 polypeptides and antibodies as diagnostic tools does not require Applicants to show 
that mRNA levels correlate to protein levels in every case, but rather only that the correlation 
exists more often than not. The data presented in Haynes is not inconsistent with or 
contradictory to the utility or enablement of the instant claims. To the contrary, the data clearly 
show a general correlation between protein levels and mRNA levels, and thus support 
AppHcants' assertion that such a general correlation exists. 

Even if Haynes supported the Examiner's argument, which it does not, one contrary 
example does not establish that one of skill in the art would find it is more likely than not there is 
no general correlation between mRNA level and protein levels. Li fact, the working hypothesis 
among those skilled in the art, as illustrated by the evidence presented above by Applicants, is 
that there is a direct correlation between mRNA levels and protein levels. This is further 
supported by the statement in Haynes that "interpretations of quantitative mRNA expression 
profiles frequently implicitly or explicitly assume that for specific genes the transcript levels are 
indicative of the levels of protein expression." See, Haynes, p. 1863, first full paragraph. 
Haynes does not suggest there is no correlation between mRNA and protein levels, but rather 
points to what the authors beUeve are shortcomings of using mRNA quantification to predict 
protein levels; specifically, that mRNA levels may not accurately predict protein levels in each 
particular instance. Considering the more likely than not standard for utility, Haynes' 
identification of reasons why proteomic analysis may be preferable in some cases does not 
contradict Applicants' evidence that there is a general correlation between mRNA and protein 
levels. 

Hu et al 

The Examiner states that Hu et al. analyzed 2286 genes that showed a greater than 1-fold 
difference in mean expression level between breast cancer samples and normal samples in a 
microarray. Hu discovered that for genes displaying a 5-fold change or less in tumors compared 
to normal, there was no evidence of a correlation between ahered gene expression and a known 
role in the disease. However, among genes with a 10-fold or more change in expression level, 
there was a strong and significant correlation between expression level and a published role in 
the disease. 

In Hu, the researchers used an automated literature-mining tool to summarize and 
estimate the relative strengths of all human gene-disease relationships published on Medline. 
They then generated a microarray expression dataset comparing breast cancer and normal breast 
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tissue. Using their data-mining tool, they looked for a correlation between the strength of the 
literature association between the gene and breast cancer, and the magnitude of the difference in 
expression level. They report that for genes displaying a 5-fold change or less in tumors 
compared to normal, there was no evidence of a correlation between altered gene expression and 
a known role in the disease. See Hu at 411. However, among genes with a 10-fold or more 
change in expression level, there was a strong correlation between expression level and a 
published role in the disease. Id, at 412. Importantly, Hu reports that the observed correlation 
was only found among estrogen receptor-positive tumors, not ER-negative tumors. Id. 

The general findings of Hu are not surprising - one would expect that genes that have the 
greatest change in expression in a disease would be the first targets of research, and therefore 
have the strongest known relationship to the disease as measured by the number of reports of a 
connection in the literature. But this does not mean that genes, and their corresponding proteins, 
with a lower level of change in expression are not important or cannot be used as molecular 
markers of the disease. This is demonstrated by the fact that ER-negative tumors did not show a 
correlation. The correlation reported in Hu only indicates that the greater the change in 
expression level, the more likely it is that there is a published or known role for the gene in the 
disease, as found by their automated literature-mining software. Nowhere in Hu does it say that 
a lack of correlation in their study means that the genes, and their corresponding proteins, with a 
less than five-fold change in level of expression in cancer cannot serve as a molecular marker of 
cancer. Genes with lower levels of change in expression may or may not be the most important 
genes in causing the disease, but the genes and their corresponding proteins can still show a 
consistent and measurable change in expression. While such genes and polypeptides may or 
may not be good targets for fiirther research, they can nonetheless be used as diagnostic tools. 
Thus, Hu does not refute the Applicants' assertion that the PRO 1327 gene, and its corresponding 
polypeptide and antibodies, can be used as a cancer diagnostic tool because they are 
differentially expressed in certain tumors. 

Thus, the evidence presented by Applicants, including the declarations of Grimaldi and 
Polakis, the accompanying references, the excerpts from "Molecular Biology of the Cell", and 
the Zhigang et al. and Meric et al. articles estabUsh that the accepted understanding in the art is 
that there is a reasonable correlation between gene expression and the level of the encoded 
protein. Further, the references cited by the Examiner as evidence that no such general 
understanding exists do not in any way support the Examiner's argimients. To the contrary, 
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Haynes, Pennica, and Hu each provide strong evidence of the correlation asserted by AppUcants. 
Accordingly, Applicants respectfully submit that the totality of the evidence clearly supports the 
conclusion that one of skill in the art understands that, more likely than not, levels of mRNA 
directly correlate with levels of corresponding proteins. 

Considering that evidence of utility does not have to be to an absolute certainty, 
Applicants submit they have provided sufficient evidence to show a reasonable correlation 
between PR01327 mRNA levels and levels of PR01327 polypeptides. Further, because the 
PR01327 mRNA is expressed at a higher level in normal esophagus, stomach, lung, rectum, and 
skin tissue relative to tumors in these same tissue types, Applicants submit it is more likely than 
not that the PRO 1327 polypeptide will also be expressed at a higher level in normal esophagus, 
stomach, lung, rectum, and skin tissue relative to tumors in these same tissue types. One of skill 
in the art would recognize that a protein which is differentially expressed in certain cancer cells 
compared to the corresponding normal tissue, as well as antibodies specific for such a protein, 
would have utility as diagnostic and therapeutic tools. Thus, Applicants submit that it is more 
likely than not that one of skill in the art would recognize the asserted utility of the PRO 1327 
polypeptides and antibodies as cancer diagnostic and therapeutic tools. Accordingly, Applicants 
respectfully request withdrawal of the rejections of the pending claims under §101. 

Rejection under 35 U,S.C> S112 - Enablement 

The PTO has maintained its rejection of Claims 1-5 under 35 U.S.C. §112, first 
paragraph. The PTO states that since the claimed invention is not supported by either a specific 
asserted utility or a well established utility, one skilled in the art would not know how to use the 
claimed invention. 

Applicants believe that the evidence, declarations, references, and arguments discussed 
above make clear that Applicants have established that one of skill in the art would be 
convinced, to a reasonable probability, that it is more likely than not that PRO 1327 proteins are 
underexpressed in esophagus, stomach, lung, rectum, and skin tumors relative to normal tissues 
of the same type, and therefore have utility as a diagnostic tool for detecting tumors. This would 
include the use of the PRO 1327 polypeptides to create diagnostic and therapeutic antibodies. 
This use is disclosed in the application, and the techniques for the creation of antibodies are well 
known and routine in the art. Thus, at least one use of PR01327 polypeptides and antibodies is 
adequately enabled, which is all that is required - "if any use is enabled when multiple uses are 
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disclosed, the application is enabling for the claimed invention." M.P.E.P. 2164.01(c). In view 
of the above, Applicants respectfully request that the Examiner reconsider and withdraw the 
enablement rejection under 35 U.S.C, § 1 12, first paragraph. 

The Examiner states that Applicants "point to the results of the amplification assay which 
showed an approximate 2-fold amplification of the PR01327 DNA in some cancers and some 
normal tissues as compared to lung cancers and colon cancers." The Examiner believes that 
"one skilled in the art would not assume that a small increase in gene copy number, in both 
normal tissues and cancer tissues, would correlate with significantly increased mRNA or 
polypeptide levels." Applicants again point out to the Examiner that the data in Example 18 is 
not gene ampUfication data. Rather, the expression data in Example 18 reflect levels of mRNA 
in the tested tissue types. The differential expression of PR01268 polypeptide-encoding 
mRNAs in tumor tissues relative to normal tissues of the same type render the mRNA usefiil for 
diagnosing and treating tumors, specifically, esophagus, stomach, lung, rectum, and skin tumors. 

Applicants respectfiiUy submit that the great weight of the evidence supports the utility 
and enablement of the claimed antibodies and polypeptides. Applicants have provided numerous 
examples demonstrating a general understanding in the art that protein levels are regulated 
primarily by regulating mRNA levels in the large majority of cases, including the statements in 
Alberts, a leading textbook in the field of Molecular Biology, and the declarations of Dr. Polakis 
and Dr. Grimaldi, both experts in the field of Cancer Biology with numerous years of experience. 
Of particular significance is the fact that these references have identified the general 
understanding in the field, as opposed to isolated examples. In addition, the experiments 
testified to by Dr. Polakis as well as those in Haynes show a correlation between mRNA and 
protein levels for a large number of different genes. These references are in addition to the 
nimierous examples of particular genes shown by Applicants, including those in Example 18 of 
the specification, and in Zhigang, Meric, and Pennica. Applicants respectfiiUy submit that the 
totality of the above-cited evidence clearly establishes that those of skill in the art would believe 
that mRNA levels more likely than not correlate with protein levels. In light of the fact that 
Applicants need not show a necessary correlation between mRNA and protein levels, Applicants 
respectfiiUy submit that they have rebutted any prima facie case of non-utility and non- 
enablement the Examiner may have established. In fact, the "more likely than not" standard 
would effectively be an "absolute certainty" standard if the Examiner's few instances of post- 
transcriptional regulation were found to establish the non-existence of a general correlation 
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between mRNA and protein levels in light of the totality of evidence produced above by 
Applicants. Accordingly, AppUcants request withdrawal of the rejection of the pending claims 
under35U,S.C.§112. 

Continuity 

The Examiner states that since the claimed invention does not have utility, the provisional 
patent applications listed, although disclosing the same experimental assays as the instant 
specification, do not impart utility to the instant invention. Therefore, the filing date of May 3, 
2002 in considered the priority date. 

AppUcants have previously listed the priority information for the instant application in a 
Preliminary Amendment mailed September 3, 2002. The preliminary amendment states that the 
instant appUcation is a continuation of, and claims priority under 35 U.S.C. § 120 to, US 
Application 10/006867 filed 12/6/2001, which is a continuation of, and claims priority under 35 
U.S.C. § 120 to, PCT Application PCT/USOO/23328 filed 8/24/2000, which is a continuation-in- 
part of, and claims priority under 35 U.S.C. § 120 to US AppHcation 09/403297 filed 
10/18/1999, now abandoned, which is the National Stage filed under 35 USC § 371 of PCT 
Application PCT/US99/20111 filed 9/1/1999, which claims priority under 35 USC § 119 to US 
Provisional Application 60/100684 filed 9/17/1998. 

Applicants submit that for the reasons stated above, the claimed antibodies have a 
credible, substantial, and specific utility. The sequence of SEQ ID NO:92 was first disclosed in 
US Provisional AppUcation 60/100684 filed September 17, 1998 in Figure 2 and SEQ ID N0:2. 
The data in Example 18 (Tumor Versus Normal Differential Tissue Expression Distribution) 
were first disclosed in PCT AppUcation PCT/USOO/23328 filed August 24, 2000, on page 93, 
line 3, through page 96, line 35. Thus, Applicants submit they are entitled to a priority date of at 
least August 24, 2000, if not the earUer date of September 17, 1998. 

CONCLUSION 

In view of the above, Applicants respectfully maintain that the claims are patentable and 
request that they be passed to issue. Applicants invite the Examiner to call the undersigned if 
any remaining issues may be resolved by telephone. 
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Figure 6-3 Genes can be expressett 
with different efficiencies. Gene A Is 
transcribed and translated much more 
efllclentlX than gene B.This allows the 
amount of protein A in the cell be 
much greater than that of protein B. 
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TRANSLATION 



FROM DNA TO RNA 

IVanscriptioa and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made ficom the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rq)idly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of otiiers (Figure 6-3). Moreover, as we see in 
the next diapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA— 
the language of a nucleotide sequence. Hence the name transcription. 

like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-4). It differs from 
DNA chemically in two' respects: (1) the nucleotides in RNA are 
riboniicleotides--4h2it is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deo^Qrribose; (2) altixough, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracfl (U) 
instead of tiie thymine (T) in DNA- Since U, like X can base-pair by hydrogen- 
bondmg with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chgqpters 4 and 6 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U), It is not imcommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these smaJl chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA alwa^ occurs in cells as a double- 
stranded helix, RNA is sin^e-stranded. RNA chains tiierefore fold up hito a 
variety of shapes, just as a polypeptide chahi folds up to form the final shape of 
a protein (Figure 6-6). As we see later in this chapter, the abUity to fold into com- 
plex three-dhnensional shapes allows some RNA molecules to have stmctural 
and catalytic functions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

AH of tiiie RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates that cause human disease. {A) Schematic Illustration of the type of 
conformational change In a protein that produces niaterial for a cross-beta filament (B) Diagram Illustrating 
the self-Infectious nature of the protein aggregation that Is central to prion diseases. PrP Is highly unusual 
because the misfolded version of the prot«n, called PrP*, induces the nbrmal PrP protein it contacts- to 
change Its confomiation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggregation, but because this struaure Is 
not Infectious in this way. It cannot spread from one animal to another. (C) Dravrtng of a cross-beta filament, 
a common type of protease-resistant protein aggregate found in a variety of human neurological diseases. 
Because tiie hydrogen-bond interactions in a P sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP*. showing the likely change of two a-helices Into four 
p-sQrands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form is not yet known wfth certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpell, adapted from M. Sunde ct al.,J. Mo/. B/ot 273:729-739. 
l997;D.adaptedfromS.&Pnistner,1f«n(fcBrocftem.Scf.2i:4^ 1996.) 

animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the '^nad cow disease'^) from catde to humans in Great Britain. 

Foitimately; in the absence of PrP*» PrP is extraordinarily difiBcult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an infectious conformationi a similar transformation has been discovered 
to be the cause of an otherwise mysterious **protein-only inheritance** observed 
in yeast cells. 

There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a propeiiy folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
ispeifiarmed, 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs, la principle, any or all of the steps in Fig- 



FROMRNATDPROTBN 



363 




lAAAA mRNA 
DCPOKT poly-AtafI 



NUCLEUS 



CYTOSOl 



lAAAA mRNA 



T IN 



mRNA DEGRADATION 



INrmilON OF PROTEIN SYNTHESIS 



COMPLETION OFPROTEIN SYNTHESIS 
AND PROTEIN FOLDING • 

H,N 
COOH 

^ PROTEIN DEGRADATION 
O^COOH 



Figure 6-90 The production of a 
protein by a eucaryotic cell* The final 
level of each protein in a eucaryotic cell 
depends upon the efTidency of each step 
depicted. 



lure 6-90) could be regulated by the cell for each individual protem. However, as 
we shall see in Oiapter 7, the initiation of transcription is the most common 
point for a cell to regulate die expression of each of its genes. Iliis makes sense, 
inasmudi as the most efEldent way to keep a gene from being expressed is to 
block the very first step— the transcription of its DNA sequence into an RNA 
molecule. 

Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonudeoprotdn assembty catted a ribosome. The 
amino acids used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mRNA (codons). The sequerwe ofnucleotldes in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation^ a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule, A 
large ribosomal subunit binds to complete Ae ribosome and begbt the elongation 
phase of protein synthesis* Durir^ this phase, aniinoacyl tRNAs--each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by farming 
complementary base pairs wtOt the tRNA anOcodon. Bach amino add is added to the 
C-tenninal end of ^growing polypeptide by means of a cycle of three sequential 
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Figure 7^ Sfoc steps at which 
eucaryofic gene expression can be 
controlled. Controls that operate at 
steps I through S are discussed io.this 
chapter. Step 6, the regulation of protein 
activity. Includes reversible activation or 
Inactlvatlon by protein phosphorylation' 
(discussed in Chapter 3) as well as 
irreversible inactivatSon proteolytic 
d^radation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various ceU types of an organism depend on the partic- 
ular genes that the ceUs express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the patiiway 
leading from DNA to protein, and all of them can m princ^le be legulateA Thus 
a ceU can control the proteins it makes by (1) controlling v*en and how often a 
givra gfflie Is transcribed (traiiscriptionalcontrol}, (2) controlling how the RNA 
transcnpt is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mBNAs in the ceU nucleus are exported to the cytosol 
and detennmmg where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting whidi mBNAs in the cytoplasm are translated 
by nbosomes (translational control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, mactivating, degrading, or compartmentalizing specific protein 
molecules after theyhave been made (prot^activi^r control) (J5gure7-5). 

For most genes transcriptional controls are paramount. Ihis makes sense 
because, of all fhe possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the ceU will not synthesize superfluous Intetme- 
drates. In the foUowmg sections we discuss tiie DNA and protem components 
that perform this fimction by regulating fee hiitiation of gene transcription. We 
shall return at the end of die diapter to die additional ways of regulating gene 
e:q>re8sion. 



Summary 

The genome ofa ceU contains in Us DNA sequence the information to make many 
thousands of different protein and JBVA molecules. A cett typicaUy ej^nsses only a 
fraction ofits genes, and the diff^ttypes of cells in muUloOtularoreimtsmsar^ 
bwmse different sets of genes an expressed. Moreover, ceOs can Oumge the pattern 
^ genes Oi^ express in response to charges in their environment such as signals 
from oOter cells. Althou^ all of the steps involved in expressing a gene can in prin- 
^Ple be regulated, for most genes the initiation of RNA tmnsaiption Is the most 
.important point of connoL 



DMA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell detemdne w^ch of its thousands of genes to transcribe? As 
; mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
ttouedbyaiegulatory region of DNA relatively near the site where transcription 
»^ h! . regulatory regions are simple and act as switches tiiat are thrown 
|oy a single signal Many others are complex and act as tiny microprocessors, 
|r^ondlngto avariety of signak feat they Interpret and Integrate to switch the 
^neighboring gene on or oft Whether complex orslmple, these switcbing devices 
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occur In the genu line, the ceU llnedge that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ ceUs is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However promoters oif genes that remain active 
in the germ ceU lineages " (mcluding most housekeepmg genes) are kept 
immethyiated, and therefore spontaneous deaminations of Cs that occux with- 
in them can be accurately repaired Such regions are preserved in modem day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the ' 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands marie the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 

Summary 

TTie many types of cells in animals and plants are created largely fhrough mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many ceU 
division cycles and even when grown in culture, the gene r^ulatory mechanisms 
involved in creaHng them must be stable once established and heritable when the 
ceU divides. These features endow the cell tvith a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a compeHtive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other; this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Direct or indirect positivefeiedback loops, which enable 
gene regulatory proteins to perpetuate (heir own synthesis, provide a general mech- 
anismfor cell memory. Negative feedback loops with programmed delays form the 
basis for cellular clocks. 

In eucaryotes the transcription ofagene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes, * 

In addition to diffusible gene regulatory proteins, inhented states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
dally dramatic case is the inacHvation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also fimcOom in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made inU 
tialfy by other mechanisms. DNA methylation also underlies Ow phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited fiom the mother or the father. 
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Figure 7-66 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
into CG islands in .vertebrate 
genomes. A hfack tne marks the location 
of a CG dlnincleotfde In the DIMA 
sequence, while a red^loinpop*' indicates 
the presence of a methyl group on the 
CG dinucfeotlde. CG sequences that lie In 
regulatory sequences of genes that are 
transcribed in germ cells are unmethylated 
and therefore tend to be retained in 
evolution. Methylated CG sequences, on 
the other hand, tend to be lost through 
deamination of 5-methyl C toT. unless the 
CG sequence is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene e^qpression could be 
controlled Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later In the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although diese ptettranscriptiLonal 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional &>ntrol for 
nxany genes they are crucial. 



f^XTTRANSOllPTTONAL CONTROLS 



435 



World'Jbumal of Surgical Oncology | Full text | Prostate stem cell antigen (PSCA) expression in human ... Page 1 of 1 1 



16-NOV-2004 



WORLD JOURNAL OF 
SURGICAL ONCOLOGY 



Home I Browse articles | Search [ Weblinks | Submit article | My WJSO ] About WJSO 

World Journal of Surgical 
Oncology 

Volume 2 

Viewing options: 

• Abstract 

• Full text 

• PDF (778KB) 



BreaM Cancer 

R£S£AI5CH 




Open Afcee^ 



Other links: 

• E-mall to a friend 

• Download references 

• Post a comment 

• PubMed record 

• Related articles in PubMed 

Search PubMed For: 
Zhigang Z 
Wenlv S 

ICey: 

^E-mail 

ffi Corresponding author 



Research 

Prostate stem cell antigen (PSCA) expression in 
human prostate cancer tissues and its potential role 
In prostate carcinogenesis and progression of prostate 
cancer 

Zhao Zhigang^ SS and Shen Wenlv^ @ 

^Department of Urology, Shantou University Medical College, Shantou, Guangdong, China 
2Department of Urology, No 2. Affiliated Hospital of Shantou University Medical College, Shantou, 
Guangdong, China 

World Journal of Surgical Oncology 2004, 2:13 doi: 10.1186/1477-7819-2-13 

The electronic version of this article is the complete one and can be found online at: 

htta;i/w.ww,^^ 

Received 30 March 2004 
Accepted 10 May 2004 
Published 10 May 2004 



© 2004 Zhigang and Wenlv; licensee BioMed Central Ltd. This is an Open Access article: 
verbatim copying and redistribution of this article are permitted in all media for any purpose, 
provided this notice is preserved along with the article's original URL. 



Outline 



Keywords: Prostate, Neoplasm, Prostate stem cell antigen (PSCA) 

Abstract 



Abstract 
Introduction 
Materials and methods 

Results 
Discussion 
Competing interests 
References 



Background 

Prostate stenn cell antigen (PSCA) is a recently defined homologue of the Thy-l/Ly- 
6 family of glycosylphosphatidylinositol (GPI)-anchored cell surface antigens. The 
purpose of the present study was to examine the expression status of PSCA protein 
and mRNA in clinical specimens of human prostate cancer (Pea) and to validate it 
as a potential molecular target for diagnosis and treatment of Pea. 

Materials and Methods 



Immunohistochemical (IHC) and In situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections from 20 
benign prostatic hyperplasia (BPH), 20 prostatic intraepithelial neoplasm (PIN) and 
48 prostate cancer (Pea) tissues. Including 9 androgen-independent prostate 
cancers. The level of PSCA expression was semiquantitatively scored by assessing 
both the percentage and Intensity of PSCA-posltive staining cells in the specimens. 
Then compared PSCA expression between BPH, PIN and Pea tissues and analysed 
the correlations of PSCA expression level with pathological grade, clinical stage and 
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progression to androgen-lndependence In Pea. 

Results 

In BPH and low grade PIN, PSCA protein and mRNA staining were weak or negative 
and less Intense and uniform than that seen In HGPIN and Pea. There were 
moderate to strong PSCA protein and mRNA expression In 8 of 11 (72.7%) HGPIN 
and In 40 of 48 (83.4%) Pea specimens examined by IHC and ISH analyses, with 
statistical significance compared with BPH (20%) and low grade PIN (22.2%) 
samples (p < 0.05, respectively). The expression level of PSCA increased with high 
Gleason grade, advanced stage and progression to androgen-lndependence (p < 
0.05, respectively). In addition, IHC and ISH staining showed a high degree of 
correlation between PSCA protein and mRNA overexpresslon. 

Conclusions 



Our data demonstrate that PSCA as a new cell surface marker is overexpressed by 
a majority of human Pea. PSCA expression correlates positively with adverse tumor 
characteristics, such as increasing pathological grade (poor ceil differentiation), 
worsening clinlcai stage and androgen-lndependence, and speculatively with 
prostate carcinogenesis. PSCA protein overexpresslon results from upregulated 
transcription of PSCA mRNA. PSCA may have prognostic utility and may be a 
promising molecular target for diagnosis and treatment of Pea. 
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Prostate cancer (Pea) is the second leading cause of cancer-related death in 
American men and is becoming a common cancer increasing in China. Despite 
recently great progress In the diagnosis and management of localized disease, 
there continues to be a need for new diagnostic markers that can accurately 
discriminate between indolent and aggressive variants of Pea. There also continues 
to be a need for the Identification and characterization of potential new therapeutic 
targets on Pea cells. Current diagnostic and therapeutic modalities for recurrent 
and metastatic Pea have been limited by a lack of specific target antigens of Pea. 



Although a number of prostate-specific genes have been identified (i.e. prostate 
specific antigen, prostatic acid phosphatase, glandular kalllkrein 2), the majority of 
these are secreted proteins not ideally suited for many immunological strategies. 
So, the Identification of new cell surface antigens Is critical to the development of 
new diagnostic and therapeutic approaches to the management of Pea. 

Reiter RE et al [1] reported the identification of prostate stem cell antigen (PSCA), 
a cell surface antigen that is predominantly prostate specific. The PSCA gene 
encodes a 123 amino acid glycoprotein, with 30% homology to stem cell antigen 2 
(Sea 2). Like Sca-2, PSCA also belongs to a member of the Thy-l/Ly-6 family and 
is anchored by a glycosylphosphatidyllnositol (GPI) linkage. mRNA in situ 
hybridization (ISH) localized PSCA expression in normal prostate to the basal cell 
epithelium, the putative stem cell compartment of prostatic epithelium, suggesting 
that PSCA may be a marker of prostate stem/progenitor cells. 



In order to examine the status of PSCA protein and mRNA expression in human Pea 



hitn*//www wifin r.nm/r.AntPsiitA7/l /I '^ 



11/16/9.004 



World Joumal of Surgical Oncology | Full text | Prostate stem cell antigen (PSCA) expression in human ... Page 3 of 11 



and validate It as a potential diagnostic and therapeutic target for Pea, we used 
Immunohlstochemlstry (IHC) and in situ hybridization (ISH) simultaneously, and 
conducted PSCA protein and mRNA expression analyses in paraffin-embedded 
tissue specimens of benign prostatic hyperplasia (BPH, n = 20), prostate 
Intraepithelial neoplasm (PIN, n = 20) and prostate cancer (Pea, n = 48), 
Furthermore, we evaluated the possible correlation of PSCA expression level with 
Pea tumbrigenesis, grade, stage and progression to androgen-independence. 
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Correlation of PSCA 
expression with clinical 
stage 



Tissue samples 

All of the clinical tissue specimens studied herein were obtained from 80 patients of 
57-84 years old by prostatectomy, transurethral resection of prostate (TURP) or 
biopsies. The patients were classified as 20 cases of BPH, 20 cases of PIN, 40 cases 
of primary Pea, including 9 patients with recurrent Pea and a history of androgen 
ablation therapy (orchiectomy and/or hormonal therapy), who were referred to as 
androgen-independent prostate cancers. Eight specimens were harvested from 
these androgen-independent Pea patients prior to androgen ablation treatment. 
Each tissue sample was cut into two parts, one was fixed in 10% formalin for IHC 
and the other treated with 4% paraformaldehyde/O.l M PBS PH 7.4 in 0.1% DEPC 
for 1 h for ISH analysis, and then embedded in paraffin. All paraffin blocks 
examined were then cut into 5 jim sections and mounted on the glass slides 
specific for IHC and ISH respectively In the usual fashion. H&E-stained section of 
each Pea was evaluated and assigned a Gleason score by the experienced 
urological pathologist at our institution based on the criteria of Gleason score [2]. 
The Gleason sums are summarized in Table 1. Clinical staging was performed 
according to Jewett-whitmore-prout staging system, as shown in Table 2. In the 
category of PIN, we graded the specimens into two groups, i.e. low grade PIN 
(grade I - II) and high grade PIN (HGPIN, grade III) on the basis of literatures 
[3,4]. 

Immunohlstochemlcal (IHC) analysis 

Briefly, tissue sections were deparaffinized, dehydrated, and subjected to 
microwaving in 10 mmol/L citrate buffer, PH 6.0 (Boshide, Wuhan, China) in a 900 
W oven for 5 min to induce epitope retrieval. Slides were allowed to cool at room 
temperature for 30 min. A primary mouse antibody specific to human PSCA 
(Boshide, Wuhan, China) with a 1:100 dilution was applied to incubate with the 
slides at room temperature for 2 h. Labeling was detected by sequentially adding 
blotlnylated secondary antibodies and strepavidin-peroxidase, and localized using 
3,3'-diaminobenzidine reaction. Sections were then counterstalned with 
hematoxylin. Substitution of the primary antibody with phosphate-buffered-saline 
(PBS) served as a negative-staining control. 



mRNA in situ hybridization (ISH) 

Five -jim -thick tissue sections were deparaffinized and dehydrated, then digested in 
pepsin solution (4 mg/ml in 3% citric acid) for 20 miri at 37.5°C, and further 
processed for ISH. Digoxigenln-labeled sense and antlsense human PSCA RNA 
probes (obtained from Boshide, Wuhan, China) were hybridized to the sections at 
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48®C overnight. The posthybridlzatlon wash with a high stringency was performed 
sequentially at 37°C in 2 x standard saline citrate (SSC) for 10 min, in 0.5 x SSC 
for 15 min and In 0.2 x SSC for 30 min. The slides were then incubated to 
biotinylated mouse anti-dlgoxigenin antibody at 37.5®C for 1 h followed by washing 
in 1 X PBS for 20 min at room temperature, and then to strepavidin-peroxidase at 
37,5°C for 20 min followed by washing in 1 x PBS for 15 min at room temperature. 
Subsequently, the slides were developed with diaminobenzidine and then 
counterstained with hematoxylin to localize the hybridization signals. Sections 
hybridized with the sense control probes routinely did not show any specific 
hybridization signal above background. All slides were hybridized with PBS to 
substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCA immunostaining and 
mRNA in situ hybridization, the same scoring manners are taken in the present 
study for PSCA protein staining by IHC and PSCA mRNA staining by ISH. Each slide 
was read and scored by two independently experienced urologicai pathologists 
using Olympus BX-41 light microscopes. The evaluation was done in a blinded 
fashion. For each section, five areas of similar grade were analyzed 
semiquantltatively for the fraction of cells staining. Fifty percent of specimens were 
randomly chosen and rescored to determine the degree of interobserver and 
intraobserver concordance. There was greater than 95% Intra- and Interobserver 
agreement. 

The intensity of PSCA expression evaluated microscopically was graded on a scale 
of 0 to 3+ with 3 being the highest expression observed (0, no staining; 1+, mildly 
Intense; 2+, moderately intense; 3+, severely intense). The staining density was 
quantified as the percentage of cells staining positive for PSCA with the primary 
antibody or hybridization probe, as follows: 0 = no staining; 1 = positive staining in 
<25% of the sample; 2 = positive staining in 25%-50% of the sample; 3 = 
positive staining in >50% of the sample. Intensity score (0 to 3+) was multiplied 
by the density score (0-3) to give an overall score of 0-9 [1,5]. In this way, we 
were able to differentiate specimens that may have had focal areas of increased 
staining from those that had diffuse areas of increased staining [6], The overall 
score for each specimen was then categorically assigned to one of the following 
groups: 0 score, negative expression; 1-2 scores, weak expression; 3-6 scores, 
moderate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expression in BPH, PIN and Pea 
tissues were compared using the Chi-square and Student's t-test. Univariate 
associations between PSCA expression and Gieason score, clinical stage and 
progression to androgen-lndependence were calculated using Fisher's Exact Test. 
For all analyses, p < 0.05 was considered statistically significant. 



Outline Results 

Abstract PSCA expression in BPH 

Introduction 
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Figures 




Figure 1 

Representatives of PSCA 
IHC and ISH staining In Pea 
(A. IHC staining, B. ISH 
staining, x200 
magnification) 



In general, PSCA protein and mRNA were expressed weakly in individual samples of 
BPH. Some areas of prostate expressed weal< levels (composite score 1-2), 
wtiereas other areas were completely negative (composite score 0). Four cases 
(20%) of BPH liad moderate expression of PSCA protein and mRNA (composite 
score 4-6) by IHC and ISH, In 2/20 (10%) BPH specimens, PSCA mRNA expression 
was moderate (composite score 3-6), but PSCA protein expression was weai< 
(composite score 2) in one and negative (composite score 0) in the other. PSCA 
expression was localized to the basal and secretory epithelial cells, and prostatic 
stroma was almost negative staining for PSCA protein and mRNA In all cases 
examined. 

PSCA expression in PIN 

In this study, we detected weak or negative expression of PSCA protein and mRNA 
(<2 scores) In 7 of 9 (77.8%) low grade PIN and In 2 of 11 (18.2%) HGPIN, and 
moderate expression (3-6 scores) in the rest 2 low grade PIN and 5 of 11 (45.5%) 
HGPIN. One HGPIN with moderate PSCA mRNA expression (6 score) was found 
weal< staining for PSCA protein (2 score) by IHC. Strong PSCA protein and mRNA 
expression (9 score) were detected in the remaining 3 of 11 (27.3%) HGPIN. There 
was a statistically significant difference of PSCA protein and mRNA expression 
levels observed between HGPIN and BPH (p < 0.05), but no statistical difference 
reached between low grade PIN and BPH (p > 0.05). 



PSCA expression in Pea 



In order to determine if PSCA protein and mRNA can be detected in prostate 
cancers and If PSCA expression levels are increased in malignant compared with 
benign glands, Forty-eight paraffin-embedded Pea specimens were analysed by IHC 
and ISH. It was shown that 19 of 48 (39.6%) Pea samples stained very strongly for 
PSCA protein and mRNA with a score of 9 and another 21 (43.8%) specimens 
displayed moderate staining with scores of 4-6 (Figure 1). In addition, 4 specimens 
with moderate to strong PSCA mRNA expression (scores of 4-9) had weal< protein 
staining (a score of 2) by IHC analyses. Overall, Pea expressed a significantly 
higher level of PSCA protein and mRNA than any other specimen category in this 
study (p < 0.05, compared with BPH and PIN respectively). The result 
demonstrates that PSCA protein and mRNA are overexpressed by a majority of 
human Pea. 



Correlation of PSCA expression witii Gleason score in Pea 

Using the semi-quantitative scoring method as described in Materials and Methods, 
we compared the expression level of PSCA protein and mRNA with Gleason grade 
of Pea, as shown in Table 1. Prostate adenocarcinomas were graded by Gleason 
score as 2-4 scores = well-differentiation, 5-7 scores = moderate-differentiation 
and 8-10 scores = poor-differentiation [7]. Seventy-two percent of Gleason scores 
8-10 prostate cancers had very strong staining of PSCA compared to 21% with 
Gleason scores 5-7 and 17% with 2-4 respectively, demonstrating that poorly 
differentiated Pea had significantly stronger expression of PSCA protein and mRNA 
than moderately and well differentiated tumors (p < 0.05). As depicted In Figure 1, 
IHC and ISH analyses showed that PSCA protein and mRNA expression in several 
cases of poorly differentiated Pea were particularly prominent, with more intense 
and uniform staining. The results indicate that PSCA expression Increases 
significantly with higher tumor grade in human Pea. 
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Correlation of PSCA expression with clinical stage iii Pea 

with regards to PSCA expression in every stage of Pea, we showed the resuits in 
Table 2. Seventy-five percent of locally advanced and node positive cancers (i.e. C- 
D stages) expressed statistically high levels of PSCA versus 32.5% that were organ 
confined (i.e. A-B stages) (p < 0.05). The data demonstrate that PSCA expression 
increases significantly with advanced tumor stage in human Pea. 

Correlation of PSCA expression with androgen-independent 
progression of Pea 

All 9 specimens of androgen-independent prostate cancers stained positive for 
PSCA protein and mRNA. Eight specimens were obtained from patients managed 
prior to androgen ablation therapy. Seven of eight (87.5%) of these androgen- 
independent prostate cancers were in the strongest staining category (score = 9), 
compared with three out of eight (37.5%) of patients with androgen-dependent 
cancers (p < 0.05). The results demonstrate that PSCA expression increases 
significantly with progression to androgen-independence of human Pea. 

It is evident from the results above that within a majority of human prostate 
cancers the level of PSCA protein and mRNA expression correlates significantly with 
increasing grade, worsening stage and progression to androgen-independence. 

Correlation of PSCA immunostaining and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the results of PSCA IHC staining 
with mRNA ISH analysis. Positive staining areas and Its intensity and density scores 
evaluated by IHC were Identical to those seen by ISH in 79 of 88 (89.8%) 
specimens (18/20 BPH, 19/20 PIN and 42/48 Pea respectively). Importantly, 27/27 
samples with PSCA mRNA composite scores of 0-2, 32/36 samplers with scores of 
3-6 and 22/24 samples with a score of 9 also had PSCA protein expression scores 
of 0-2, 3-6 and 9 respectively. However, in 5 samples with PSCA mRNA overall 
scores of 3-6 and in 2 with scores of 9 there were less or negative PSCA protein 
expression (I.e. scores of 0-4), suggesting that this may reflect posttranscrlptional 
modification of PSCA or that the epitopes recognized by PSCA mAb may be 
obscured in some cancers. The data demonstrate that the results of PSCA 
immunostaining were consistent with those of mRNA ISH analysis, showing a high 
degree of correlation between PSCA protein and mRNA expression. 
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PSCA is homologous to a group of ceil surface proteins that mark the earliest phase 
of hematopoietic development. PSCA mRNA expression is prostate-specific In 
normal male tissues and Is highly up-regulated in both androgen-dependent and- 
independent Pea xenografts (IJVPC-4 tumors). We hypothesize that PSCA may play 
a role in Pea tumorigenesis and progression, and may serve as a target for Pea 
diagnosis and treatment. In this study, IHC and ISH showed that in general there 
were weak or absent PSCA protein and mRNA expression In BPH and low grade PIN 
tissues. However, PSCA protein and mRNA are widely expressed In HGPIN, the 
putative precursor of invasive Pea, suggesting that up-regulatlon of PSCA is an 
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early event in prostate carcinogenesis. Recently, Reiter RE et al [1], using ISH 
analysis, reported that 97 of 118 (82%) HGPIN specimens stained strongly positive, 
for PSCA niRNA. A very similar finding was seen on mouse PSCA (mPSCA) 
expression in mouse HGPIN tissues by Iran C. P et al [8]. Tlnese data suggest tliat 
PSCA may be a new marker associated with transformation of prostate cells and 
tumorlgenesls. 

We observed that PSCA protein and mRNA are highly expressed in a large 
percentage of human prostate cancers, including advanced, poorly differentiated, 
androgen-lndependent and metastatic cases. Fluorescence-activated cell sorting 
and confocal/ immunofluorescent studies demonstrated cell surface expression of 
PSCA protein in Pea cells [9]. Our IHC expression analysis of PSCA shows not only 
cell surface but also apparent cytoplasmic staining of PSCA protein in Pea 
specimens (Figure 1). One possible explanation for this is that anti-PSCA antibody 
can recognize PSCA peptide precursors that reside in the cytoplasm. Also, it Is 
possible that the positive staining that appears in the cytoplasm is actually fronn 
the overlying cell membrane [5], These data seem to Indicate that PSCA is a novel 
cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression correlates with high grade 
(i.e. poor differentiation), increased tumor stage and progression to androgen- 
independence of Pea. These findings support the original IHC analyses by Gu Z et al 
[9], who reported that PSCA protein expressed In 94% of primary Pea and the 
intensity of PSCA protein expression increased with tumor grade, stage and 
progression to androgen-independence. Our results also collaborate the recent 
work of Han KR et al [10]/ in which the significant association between high PSCA 
expression and adverse prognostic features such as high Gleason score, seminal 
vesicle invasion and capsular involvement in Pea was found. It Is suggested that 
PSCA overexpression may be an adverse predictor for recurrence, clinical 
progression or survival of Pea. Hara H et al [11] used RT-PCR detection of PSA, 
PSMA and PSCA in 1 ml of peripheral blood to evaluate Pea patients with poor 
prognosis. The results showed that among 58 PCa patients, each PCR indicated the 
prognostic value in the hierarchy of PSCA>PSA>PSMA RT-PCR, and extraprostatic 
eases with positive PSCA PCR indicated lower disease-progression-free survival 
than those with negative PSCA PCR, demonstrating that PSCA can be used as a 
prognostic factor. Dubey P et al [12] reported that elevated numbers of PSCA + 
cells correlate positively with the onset and development of prostate carcinoma 
over a long time span In the prostates of the TRAMP and PTEN +/- models 
compared with its normal prostates. Taken together with our present findings. In 
which PSCA is overexpressed from HGPIN to almost frank carcinoma, it Is 
reasonable and possible to use Increased PSCA expression level or increased 
numbers of PSCA-positive cells in the prostate samples as a prognostic marker to 
predict the potential onset of this cancer. These data raise the possibility that PSCA 
may have diagnostic utility or clinical prognostic value In human Pea. 

The cause of PSCA overexpression in Pea is not known. One possible mechanism is 
that It may result from PSCA gene amplification. In humans, PSCA Is located on 
chromosome 8q24.2 [1], which Is often amplified in metastatic and recurrent Pea 
and considered to indicate a poor prognosis [ 13-15 ]. Interestingly, PSCA is in close 
proximity to the c-myc oncogene, which is amplified In >20% of recurrent and 
metastatic prostate cancers [16,17]. Reiter RE et al [18] reported that PSCA and 
MYC gene copy numbers were co-amplified In 25% of tumors (five out of twenty), 
demonstrating that PSCA overexpression Is associated with PSCA and MYC 
coampllflcatlon In Pea. Gu Z et al [S] recently reporteted that In 102 specimens 
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available to compare the results of PSCA Immunostaining with their previous mRNA 
ISH analysis, 92 (90.2%) had Identically positive areas of PSCA protein and mRNA 
expression. Taken together with our findings, in which we detected moderate to 
strong expression of PSCA protein and mRNA in 34 of 40 (85%) Pea specimens 
examined simultaneously by IHC and ISH analyses, it Is demonstrated that PSCA 
protein and mRNA overexpressed In human Pea, and that the increased protein 
level of PSCA was resulted from the upregulated transcription of Its mRNA. 

At present, the regulation mechanisms of human PSCA expression and Its biological 
function are yet to be elucidated. PSCA expression may be regulated by multiple 
factors [18]. Watabe T et a! [19] reported that transcriptional control is a major 
component regulating PSCA expression levels. In addition, induction of PSCA 
expression may be regulated or mediated through cell-cell contact and protein 
kinase C (PKC) [20]. Homologues of PSCA have diverse activities, and have 
themselves been Involved in carcinogenesis. Signalling through SCA-2 has been 
demonstrated to prevent apoptosis in Immature thymocytes [21]. Thy-1 is involved 
in T cell activation and transducts signals through src-like tyrosine kinases [22]. 
Ly'6 genes have been implicated both in tumorigenesis and in cell-cell adhesion 
[ 23-2 5]. Cell-cell or cell-matrix Interaction is critical for local tumor growth and 
spread to distal sites. From its restricted expression In basal cells of normal 
prostate and Its homology to SCA-2, PSCA may play a role in stem/progenitor cell 
function, such as self-renewal (i.e. antl-apoptosis) and/or proliferation [1]. Taken 
together with the results in the present study, we speculate that PSCA may play a 
role in tumorigenesis and clinical progression of Pea through affecting cell 
transformation and proliferation. From our results, it is also suggested that PSCA as 
a new cell surface antigen may have a number of potential uses In the diagnosis, 
therapy and clinical prognosis of human Pea. PSCA overexpression in prostate 
biopsies could be used to identify patients at high risk to develop recurrent or 
metastatic disease, and to discriminate cancers from normal glands in 
prostatectomy samples. Similarly, the detection of PSCA-overexpressing cells in 
bone marrow or peripheral blood may Identify and predict metastatic progression 
better than current assays, which identify only PSA-positive or PSMA-positive 
prostate cells. 

In summary, we have shown In this study that PSCA protein and mRNA are 
maintained In expression from HGPIN through all stages of Pea in a majority of 
cases, which may be associated with prostate carcinogenesis and correlate 
positively with high tumor grade (poor cell differentiation), advanced stage and 
androgen-lndependent progression. PSCA protein overexpression is due to the 
upregulation of its mRNA transcription. The results suggest that PSCA may be a 
promising molecular marker for the clinical prognosis of human Pea and a valuable 
target for diagnosis and therapy of this tumor. 
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Abstract 

Translation initiation is regulated In respmse to 
nutHent avaflablTity and mitogenic stimulation and is 
coupled with ceil cycle progression and cell gnswth. 
Several alterations In translatlonal control occur In 
cancer. Variant mRNA sequences can alter the 
translationai efficiency of individual mRNA molecules, 
which in turn play a role in cancer biology. Changes in 
the exprssslon or availability of cotnpomnts of the 
translatlonal machinery and In ttie activation of 
translation through signal transduction pathways can 
lead to more global changes, such as an increase In 
the overall rate of protein synthesis and translatlonal 
acthmtion of the mRNA molecules involved in cell 
growth and proliferation. We review the basic 
principles of translationai control, the aRerafions 
encountered hi cancer, and selected therapies 
targeting translation initiation to help elucidate new 
therapeutic avenues. 

introduction 

The ftindamental principle of molecular therapeutics in can- 
cer is to exploit the differences In gene expression between 
cancer cells and normal cells. With the advent of cDNA array 
technology, most efforts have concentrated on Identifying 
differences In gene expression at the level of mRNA, which 
can be attributable either to DMA amplification or to differ- 
ences In transcription. Gene expression Is quite complicated, 
however, and Is also regulated at the level of mRNA stability. 
mRNA translation, and protein stability. 

The power of translationai regulation has been best recog- 
nized among devetopmentaJ biologists, because transcription 
does not occur In early emtwyogenesls In eukaryotes. For ex- 
ample. In Xenopus, the period of transcriptional quiescence 
continues until the embryo reaches mldblastula transition, the 
4000-cell stage. Therefore, all necessary mRNA molecules are 
transcribed during oogenesis and stockpiled In a translationally 
Inactive, masked fonm. The mRNA are translationany activated 
at appropriate times during oocyte maluratkxi, fertilization, and 



earty embryogenesis and thus, are under strict translatlonal 
control. 

Translation has an established role In cell gmwth. Basi- 
cally, an increase In protein synthesis occurs as a conse* 
quence of mHogenesls. Until recently, however, llttie was 
knowr) about the alterations in mRNA translation in cancer, 
and mudi Is yet to be discovered about their role In the 
development and progression of cancer. Here we review the 
basic principles of translattdnal control, the alterations en* 
countered in cancer, and selected therapies targeting transla- 
tion initiatibn to etucklate potenty new therapeutic avenues. 

Basic Principles <tf Translationai Control 
Mechanism of Translation Inltlathn 

Translation InHiatkxi is the main step in translatlonal regulation. 
Translatbn Initiatkm is a complex prDces^ In which the Initiator 
tRNA and the 40S and 60S ribosomal subunHs are recnjited to 
the 5' endof a mtV4A moltoile and assembled tiyeukaryotic 
transiajtion initiatton factors into an SOS ribosome at the start 
codon of the mRNA (Fig. 1 ). The 5' end of eukaryotfc mRNA Is 
capped, /.SL, contains the cap stnjcture m^GpppN (7-methyl- 
guanosine-triphospho-5'-ribonucleoslde). Most translation In 
eukaryotes occurs In a cap-dependent fashton, /.e, the cap is 
specifically recognized by the elF4E,^ which binds the 5' cap. 
The elF4F translation initiation complex Is then fomied the 
assembly of elF4E. the RNA helicase elF4A, and elF4G, a 
scaflblding protein that mediates the binding of the 40S ribo- 
somal subunit to the mRNA molecule through interactk}n with 
the elF3 protein present on the 40S ribosome. elF4A and eIRB 
parttetpate bi nielting the secondary structure Gf the 5' UTR of 
the mRNA- The 43S Initiation complex (40S/elF2/Met-tRNA/ 
GTP oomple}^ scans the mRNA in a 5'-^' direction until ft 
enoounters an AUG start codon. This start codon is then base- 
paired to the anticodon of Initiator tRNA, fbnning the 48S Initi- 
ation complex. The Initiation fectoie are then displaced from the 
4dS complex, and the 60S ribosome joins to fonm the 80S 
ribosome. 

Unlike most eukaryotic translation, translation initiation of 
certain mRNAs, such as the picomavinis RNA, Is cap Inde- 
pendent and occurs t)y intemal ribosome entry. This mecha- 
nism does not require elF4E. Bther the 438 complex can bind 
the Initiation codon directly through Interaction with fine IRES In 
the 5' UTR such as In the encephalomyocarditis virus, or It can 
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Ftg, U Translation initiation In eultaiyotes. The 4E-BP8 are hypefphos- 
phoiylated to reloaseelF4E so that it can Interact with the 5' cap, and the 
elF4F biMation complex is assembled. The interaction of poly(A) landing 
protein with the initiation oomptex and drcularization of the mRNA Is not 
depleted in the diagram. The secondary stnicture <^ the 6' liTR Is melted, 
the 40S ribosomal subunit Is bound td elF3. and the ternary complex 
consisting of elF2, GTP. and the Met-tRNA we recniited to the mRNA. The 
ribosome scans the mRNA In a 5'-^^' direction until an AUG start codon 
Is found In the appropriate sequence context TTie Initiation fectors are 
released, and the large ribosomal subunit Is recniited. 



Initially attadi to tlie IRES and tlien reach tiie initiation codon by 
scanning or tiBnsfer, as is the case with the poliovirus (1). 

Regutation of Translation InKation 
Translation initiation ton be regulated by alteraHons In the 
expression or phosphorylation status of the various factors 
Ihvdved. Key componenis In translatlonal regulation that 
may provide potential therapeutic targets follow, 

elF4E. elF4E plays a central rDle in translation regulation. 
It is the least abundant of the Initiation factors and Is con- 
sidered the rate-limiting component for Initiation of cap- 
dependent translation. elF4E may also be Involved in mRNA 
splicing, mRNA 3' processing, and mRNA nucleocytoptas- 
"^c t ransp ort 0. elF4E expression can be Increased at the 
transcriptional level In response to seoim or growth factors 
(3). elF4E overexpresslon may cause preferential translation 
of mRNAs containing excessive secondary structure in their 
5' UTR that are normally dlscrinrUnated against by the trans- 



latlonal machinery and thus are Inefficiently translated (4-7). 
As examples of this, overexpresslon of elF4E promotes in- 
creased translation of vascular endothelial growth factor, 
fibroblast growth fector-2, and cyclln D1 8, 9). 

Another mechanism of control Is the regulation of elF4E 
phosphofylatioa elF4E phosphorylation Is mediated by the 
mitogen-actlvated protein Wnase-lnteracting kinase 1, which 
Is activated by the mitogen-actlvated pathway activating 
extracellular slgnal-nelated kinases and the stress-activated 
pathway acting through p38 mitogen-activated protein ki- 
nase (10-13), Several mitogens, such as senim, platelet- 
derived growth factor, epkjemial growth factor, insulin, 
^glotensln II, src kinase overracpresslon, and ras over- 
expression, lead to elF4E phosphorylation (14). The phos- 
phorylation status of elF4E Is usually comeialed with the 
translatlonal rate and growth status of the cell; however; 
elF4E phosphorylation has also been observed In response 
to som^ cellular stresses when translatlonal rates actually 
decrease (15), Thus, further study Is needed to understand 
the effects of elF4E phosphorylation on elF4E activity. 

Another mechanism of regulation is the alteration of eiF4E 
availability by the binding of elF4E to the elF4E-blnding pro- 
teins (4E-BP, also known as PHAS-I). 4E-BPs compete with 
elF4G for a binding site in elF4E, The binding of elF4E to the 
best characterized elF4E-blndIng protein, 4E-BP1, Is regu- 
lated by 4E-BP1 phosphorylatkwi. hlypophosphoryiated 4E- 
BP1 binds to elF4E, whereas 4E-BP1 hyperphosphorylalton 
decreases this binding. Insulin, angiotensin, epidermal 
growth factor, platelet-derived qtom^ factor, hepatocyte 
growth factor, nerve growth factor, insulin-like growth factors 
I and II. InterieuWn 3, prahulocyte-macrophage colony-stim- 
ulating factor + steel factor, gastrin, and the adenovirus have 
all been reported to Induce phosphorylation of 4E-BP1 and 
to decrease the ability of 4E-BP1 to bind elF4E (15, 16). 
Conversely, deprivation of nutrients or growth factors results 
in 4E-BP1 dephosphoiylatlon, an increase In elF4E binding, 
and a decrease In cap-dependent translation. 

p70 86 Kinase. Phosphorylation of ribosomal 40S protein 
S6 by S6K Is thought to play an innportant role in translational 
regulation. S6K mouse embryonic cells proliferate more 
slowly than do parental cells, demonstrating that S6K has a 
positive Influence on cell proliferation (1 7). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
ollgopyrimidlne tract ^' TOP) found at the 5' UTR of ribosomal 
protein mRNAs and other mRNAs coding for components of 
the translattonal machinery. Phosphorylation of S6K Is regu- 
lated In part based on the avaOabUity of nutrients (18, 19) and Is 
stimulated by several growth factors, such as platelet-derived 
growtii factor and Insulin-IIke growQi factor I (2(9. 

elF2a Phosphorylation. The binding of the Initiator tRNA 
to the small ribosomal unit is mediated by translation Initia* 
tlon factor elF2. Phosphorylation of the a-subunit of elF2 
prevents fonfnatton of the ell^GTP/Met-tRNA complex and 
lnhR)tts global protein synthesis (21, 2^. elF2a Is phospho* 
rylated under a variety of conditions, such as viral Infection, 
nutrfant deprivation, heme deprivation, and apoptoals (22). 
elF2a Is phosphorylated by heme-TBgulated Inhlbftor, nutrient- 
regulated protein kinase, and the IFN-tnduced, double- 
stranded RNA-activated protein kinase (PKR; Ref. 23). 
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The rtiTOR SignaHng Pathway, The macrollde antibiotic 
mpamydn (Sirallmus; Wyeth-Ayerst Research, Coilegevllle. 
PA) has been the subject of Intensive study because it In- 
hibits signal transduction pathways Involved in T-cell activa- 
tion. The rapanfrycln-senslth^e component of these pathways 
Is mTOR (also called FRAP or RAFT1). mTOR is the mam- 
malian homologue of the yeast TOR pnsteins that regulate 
progression and translation in rssponse to nutrient availabil- 
ity (24). mTOR Is a serine-threonine kinase that modulates 
translation Initiation by altering the phosphorylation stahjs of 
4E-BP1 and S6K (Fig. 2; Ref. 25). 

4E-BP1 is phosphoiylated on multiple residues. mTOR phos- 
phorylates the Thr-37 and 7hr-46 residues of 4E-BP1 h Mfto 
(2^ howev^, phosph(»7lation at these sites Is not associated 
with a loss of elF4E binding. Phosphorylation of Thr-37 and 
Thr-46 is required for subsec^ient phosphorylation at several 
COOIftenninaf, serum-sensitive sAes; a combination of these 
phosphorylaUon events appears to be needed to inhibit the 
binding of 4E-BP1 to elF^ The product of the i^TM gene, 
p3a/MSK1 pathw^, andpnstelnidnaseCa'alsoplayarolein 
4E-BP1 phosphorylation ^7-2^. 

S6Kand 4E-BP1 are also regulated, in part, by PI3K and its 
downstream protein Idnase Alct. PTEN Is a phosphatase that 
negatively regulates PI3K signaling. PTEN null cells have 
oonstitutively active of Akt, vtrtth increased S6K acthmy and 
86 phosphorylation (30). 86K activity Is bthibited both by 
PI3K inhlbitofs wortmannin and LY294002 and by mTOR 
Inhibitor rapamycin (24). Alct phosphoryfates Ser-2448 In 
mTOR in vftro, and this site Is phosphoryiated upon Akt 
activation in vivo (31-^). Thus, mTOR Is regulated by the 
PI3K/Akt pathway; however, this does riot appear to be the 
only mode of regulation of mTOR activity. Whether the Pi3K 
pathway also regulates S6K and 4E-BP1 phosphorylation 
independent of mTOR is controversial. 

Interestingly, mTOR autophosphorylation is bkKk&di by wort- 
nriannin but not by raparnydn (34). This seenrilhg inconste^^ 
suggests that mTORnesponslve regulation of 4E-BP1 and S6K 
activity occurs through a mechanism other than Intrinsic mTOR 
Idnase activity. An alternate pathway for 4E-BP1 and S6K phos- 
phorylation by mTOR activi^ is by the inhibition of a phospha- 
tase. Treatment witii calycuiin A, an inhitritor of phospiiatases 1 
and 2A, reduces rapamydn-lnduced dephosphofyiation of 4E- 
BP1 and S6K by rapamycin (35). Pi^ Interacts with fulHengtti 
86K but not with a S6K mutant ttiat is resistant to dephospho- 
ryiation resulting from rapamycin. mTOR phosphorylates PP2A 
In vitro; however, how this process altens PP2A activity Is not 
l<nown. These results are con^stent witii the model that phos- 
phorylation of a phosphatase by mTOR prevents dephospho- 
rylation of 4E-BP1 and S6K, and conversely, ttiat nutrient dep- 
rivation and rapamycin block inhibition of the phosphatase by 
mTOa 

Polywienylation* The poiy(A) tan In eukaryotic mRNA Is 
important In enhancing translation Initiation and mRNA sta- 
bility. Polyadenylation plays a i<ey role in regulating gene 
expression during oogenesis and eariy embryogenesis. 
Some mRNA that are translafttonally inactive In the oocyte are 
polyadenylated concomitantly with translational activation in 
oocyte maturation, whereas otii^ mRNAs that are transla- 
tionally active during oogenesis are deadenylated and trans- 
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Fig. 2, Regulation of translation inftiation by signal transduction patti- 
wgg. Sfenafing vte p38, axtiaceflutar sIgnal-relalBd kinase, PI3K, and 
mTOR can afl activate translation Initiation. 



iationally silenced (36-^). Thus, control of poly(A) tali syn- 
thesis is an Important regulatory step In gene ex|3resslon. 
The 5' cap and poWi tail are thought to function synergls- 
tically to regulate mRNA translational efRciency (39, 40). 

RNA Packaging. Most RIMA-blndlng proteins are assem- 
bled on a transcript at ttie time of transcription, tiius deter- 
mining the translational fate of the transcript (41). A highly 
conserved family of Y-box proteins Is found in cytoplasmic 
messenger ribonucieoprotein particles, where the proteins 
are thought to play a role in restricting tite recruitment of 
mRNA to the translational machinery (41-43), The major 
mRr4A-associated protein, YB-1, destabilizes the Interaction 
of elF4E and the 5' mRNA cap in vitro, and overexpression of 
YB-1 results In translational repression in vivo (44). Thus, 
alterations In RNA packaging can also play an important role 
in translational regulation. 

Translation Alterations Encountered in Cancer 

Three main alterations at tiie translational level occur In cancer 
variations In mRNA sequences tfiat increase or decrease trans- 
lational efficiency, changes In tiie ©cpresslon or avaflabllity of 
components of tiie translational machinery, and activation of 
translation tiirough abenantiy activated signal transduction 
patiiways. The first alteration affects tiie translation of an indi- 
viduai mRfsIA that may play a role In carcinogenesis. The sec- 
ond and tiiird alterations can lead to more global changes, such 
as an Increase in tiie overall rate of protein syntiiesis, and the 
translational activation of several mRNA spedes. 

Variations In mRNA Sequence 
Variations in mRNA sequence affect tiie translational effi- 
ciency (rf the transcript A brief description of these variations 
and examples of each mechanism foUow. 

MutaUons. Mutations In the mRNA sequence, especially 
in the 5' UTR, can alter Its transiational efficiency, as seen in 
the foltowing examples. 
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o-mjfc Salto eta/, propel that translation of full-length 
(Hnttyo is repressed, whereas In several BurWtt lymphomas 
that have deletions of the mRNAS' UTR. translation dlomyc 
b nrrone efficient (46). More recently, it was reported that the 
5' UTR of c*myc contains an IRES, and thus cnmyc transla- 
tion can t)e Initiated by a cap-Independent as well as a 
cap-dependent mechanism (46, 47). In patients with multiple 
myeloma, a C->T mutation In ttie c-myc IRES was identified 
(48) and found to cause an enhanced initiation of translation 
via internal rfbospma! entry (49). 

BRCAU A somatic point mutation (117 G-^C) in position 
-3 with respect to the start codon of the BRGA1 gene was 
identified In a highly aggressive sporadic tireast cancer (50). 
Chimeric constructs consisting of the wild-type or mutated 
BRCA1 5' UTR and a downstream luciferase reporter dem- 
onstrated adecrease In the translatk)na] effk>lency with the 5' 
UTR mutation. 

Cycttn-dependent Kinase Inhibitor 2A. Some inherited 
melanoma kindreds have a Q^T transversion at base -34 
of cyclin-dependent Idnase lnhlbltor-2A, which encodes a 
c/ciin-dependent Idnase 4/cyclin-dependent idnase 6 Idnase 
Inhlljltor important in checiqx>lnt rsgulaUon (51). This 
mutatloa glves rise to a novel AUG translation inltlatton 
codon« creating an upstream open reading frame that com- 
petes for scanning ribosomes and decreases translation 
ftom the wild-type AUG. 

Aftemate Spficing and Alternate Transcription Start 
Sites. AHeraHons In spfidng and alternate trsuiscription sites 
can lead to variations \n 5' UTR sequence, length, and second- 
aiy structive, ultimateiy impacting translational efficiency. 

ATM. The ATM gene has four noncoding exons In its 5' 
UTR that undergo extensive altemative splicing p2). The 
contents of 12 dlflerent 5' UTRs that show considerable 
diversity In length and sequence have been identified. These 
divergent 5' leader sequences play an important role In the 
translational regulation of the ATM gene. 

mdnu In a sut)set of tumors, overexpression of the onco- 
protein mdm2 results in enhanced translation of the mdm2 
mRh4A. Use of different promoters leads to two mdm2 tran- 
scripts that differ only in their 5' leaders (63). The longer 5' 
UTR contains two upstream open reading frames, and this 
mRNA is loaded with ribosomes inefficiently compared with 
the short 5' UTR. 

BRCA1. In a normal mammary gland, BRCA1 mRHA is 
expressed with a shorter leader sequence (5'UTRa), whereas 
insporadlcbreastcancertissue,BWG4r mRMA Is expressed 
with a longer leader sequence (5' UTRb); the translational 
efficiency of transcripts containing 5' UTRb is 1 0 times lower 
than that of transcripts containing 5' UTRa p4). 

mRNA Includes a 1.1-kb 5' UTR. which 
exerts an Inhibitory effect on translation. Many human breast 
cancer cell lines contain a novel TQF-pS transcript with a 5' 
UTR that Is 870 nucleotides shorter and has a7-fr>ld greater 
translational efficiency than the normal TQF-p3 mRNA (55). 

Alternate Polyadenylatlon Sites. i\/1ultipie polyadenyi- 
ation signals leading to the generation of several transcripts 
with differing 3' UTR have been described for several mRNA 
species, such as the WET proto-oncogene f56), ATM gene 
(5^, tissue Inhibitor of metaiioproteinases-^ (57), RHOA 



proto-oncogene (5Q, and calmodulin-l (59). Although the 
effect of these alternate 3' UTRs on translation is not yet 
known, they rnay be important In RNA-protejn interactions 
that affect translational recruitment The role of these alter- 
ations In cancer development and progression is unlmown. 



Alterations In the Components of the 
Trmsbthn Machinery 

ARmrtions in the components of translation machinery can 
take many forms. 

Overexpresssion of elM& Overexpression of elF4E 
causes malignant transfbmraatbn In rodent cells (60) and the 
daegulation of HeLa cell growth (61). Polunovsky etal. (62) 
found that elF4E overexpression substitutes for serum and 
individual growth factors in presendng viability of fibroblasts, 
which suggests that elF4E can mediate both proliferative and 
sunftol signaling. 

Bevated levels of elF4E mRNA have been found in a broad 
spectmm of transfbrmed ceil lines (63). eiF4E levels are 
elevated in ail ductal carcinoma In situ specimens and Inva- 
sive ductal cardnomasr compared with benign breast spec- 
imens evaluated with Western blot analysis (64, 65). Prelim- 
inary studies suggest that this overexpressk>n is attributable 
to gene amplification (66). 

There are accumulating data suggesting that elF4E overex- 
presston can be valuable as a prognostic martyr. eiF4E over- 
expressfon v^ras found In a retrospective study to be a martcer of 
poor prognosis In stages I to III breast carcinoma (67). Verifica- 
tion of the prcgnostte value of elF4E in br^ cancer Is now 
under way In a prospective trial ©7). However, fri a different 
study, elF4E expression was conrelaled with the c^gressive 
behavtor of non-Hodgkin's lymphomas (68). In a prospective 
analysis of patients vwth head and neck cancer, elevated levels 
of elF4E in histotogically tumor-free surgical margins predated 
a 8ignlfk»ntly Increased risk of kx:alHegional recurrence (9). 
These results an suggest that elF4E overexpression can be 
used to select patients who might benefit from more aggressive 
systemic therapy. Furthermore, the head and neck cancer data 
suggest that eiF4E overexpression is a fiekj defect and can be 
used to gukle local therapy. 

Alterations In Other initiation Factors. Alterations In a 
number of other initiation factors have been associated with 
cancer. Overproductton of elF4Q. similar to eiF4^ leads to 
malignant transfonnation In vftm (69). eiF-2a Is found In 
Increased levels in bronchloloalveolar carcinomas of the lung 
(3). Initiation factor eiF-4A1 is overexpressed in melanoma 
(7(9 and hepatocellutar carcinoma (71). The p40 subunit of 
translation Initiation factor 3 is amplified and overexpressed 
In breast and prostate cancer (72), and the elF^-p1 1 0 subunit 
is overexpressed in testicular seminoma (73). The role that 
overexpression of these inltlatton factors plays on the devel- 
opment and progressten of cancer. If any, is not knovm, 

Overexpression of S6K. S6K is amplified and highly 
overexpressed In the MCF7 breast cancer cell line, com- 
pared with nonnal mammary epithelium (74). In a study by 
Bariund ef af. (74), S6K was amplified In 59 of 668 primary 
breast tumors, and a statistically significant assodatlon was 
obsen^ed between amplification and poor prognosis. 
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Overexpression of PAP. PAP catalyzes 3' poIy{/=^ syn- 
thesis. PAP Is overexpressed in human cancer cells com- 
pared with nomial and virally transfbmied cells (76). PAP 
en2ymatlc activity in breast tumors has t>een comelated with 
PAP protein levels (76) and. In mammary tumor cytosols, was 
found to be an independent factor for fwedlcting sun^val (76). 
Uttte is loiown, however, about how PAP expression or ao* 
tivlty affects the translationat profile. 

Alterations In RNA-blnding Proteins. Even less Is known 
about attenartions in RNA padcaging in canc^. Increased ex- 
pression and nuclear localization of the RNA-blnding protein 
YB-1 are Incficators of a poor pncgnosls for breast cancer (77). 
non-ernaB oefl lung canPKT (71Q, and ovarian canoes 
ever, this effect may be mediated at least in part at the level of 
transcription, because YB-1 increases chemore^stanoe by en- 
handng the transcription of a nruittkSn^ resistant 



AcUvffOon of Signal nansduction Pathways 

Acdvatton of sfepial transduction pathways by loss of tumor 
suppressor genes or overKcpre^lon of certain tyrosine kinases 
can a)ntrfbLite to the growth and aggressiveness of tumors. An 
important,,mutant in human cancers Is the tumor suppressor 
gene P7BV, wttfch leads to the activation of the PI3K/Akt path- 
way. Activation of PI3K and Akt Induces the oncogenfo trans- 
fbrrnatfon of cHcken embryo fibroblasts, llietransfon^ 
8howconsffiuth^phosphorylatfonolS6Kandor4&^ $1). 
A mutant Akt that retains kinase activity but does not phos- 
phorylate jS6K or 4E-BP1 does not transfonn fibrobtasts, which 
suggests a connelatfon between the oncogenicity of PI3K and 
Akt and the phosphorylation of S6K and 4E-BP1 (B1). 

Several tyrosine kinases such as platelet-derived growth 
factor, Insulin-like growth f^or. HER2/neu, and epidermal 
growth factor receptor are overexpressed in cancer. Be- 
cause these kinases activate downstream signal transduc- 
tk»n pathways known to alter transtatton Inltiatfon. actlvatbn 
of translation Is likely to contribute to the growth and aggres- 
sh^eness of these tumors. Furthenmore, the mRfslA for many 
of these kinases themselves are under translational control. 
For example, HER2/neu mRNA is translatfonaliy controlled 
both by a short upstream open reading frame that represses 
HER2/neu translation In a cell type-Independent manner and 
by a distinct cell type-dependent mechanism that Increases 
translational efficiency (82). HER2/neu translation Is different 
In transfomned and nonnai cells. Thus, it Is possible that 
alteratloris at the translational level can In part account for 
the discrepancy between HERZ/neu gene ampllfteatlon de- 
tected by fluorescence in sftu hybridization and protein levels 
detected by immunohlstochemical assays. 



Translation Targets of Selected Cancer Therapy 
Components of the translation machinery and signal path- 
vvays Involved In the activation of transflatton Initial 
sent good tarigets for cancer therapy. 

TargoOig the mTOR SlgnaOng Pathway: Rapamycbt 
ond Ttonslfttti 

Rapamycfn Inhibits the pnollferatton of lymphocytes. It was 
Initially developed as an immunosuppressive drug for organ 



transplantation. Rapamydn with FKBP 12 (FK506-blnding 
protein, M, 12,000) binds to mTOR to Inhibit Its function. 

Rapamydn causes a amafl but significant reduction In the 
Inltiatton rate of protein synthesis (83), It blocks cell growth in 
part by blocking S6 phosphorylaHw and selectively sup- 
pressing the translalton of 6' TOP mRNAs, such as ribosomal 
proteins, and elongation foctors (83-^89. Rapamydn also 
blocks 4B-BP1 phosphorylation and Inhibits capKiependent 
but not cap-Independent translatfon (17, 86). 

The rapamydn-sensltive signal transductton pathway, acti- 
vated durir^ mangnanf transldnmatkxi and cancer progr^skH% 
is now being studied as a target for cancer therajy ^7). Pros- 
tate, breast, small ceQ lung, gOoblastoma, melanoma, and T-cell 
leukemia ana among the cancer lines most sensitive to the 
rapamydn anak)gue 00-779 (Wyeth-Ayerst Research; Ref. 
87). In ifidbdonriyoosarcdnia cen lines, ra^ 
statte or cytoddal. dependlr^ on the p53 status of the ceH; p53 
wOd^type ceOs treated with fapamydnanest in the phaso 
and nriaintain their viability, whereas p53 mutant cells acci^ 
late In Gi and undergo apoptosls (88, 8$. In a recently reported 
study i^lng human prinrtKive neunoeotodennal tumor and 
medulioblastoma models, rapamydn exhibited more cytotox- 
k% In combination with dsplatin and camptothedn than as a 
single agent fr) vivo, CCI-779 ddayed growth of xenografts by 
160% after 1 week of therapy and 240% after2 weeks. Adngle 
high-doee admlnlstratton caused a 37% decrease In tumor 
vduma Growth IrWtkx) h vivo was 13 times greater, with 
dsplatin In cpmblnatfon with GCI-779 than with dsplatin alone 
PP). Thus, preclinical studies suggest that rapamydn ana- 
\og^ 9e useful as single agents and fai comblnatk)n with 
cherrKJtiierE^sy. 

Rapamydn analogues CCi-779 and RAD001 (Novartis, 
Basel, Switzerland) are now In clinical trials. Because of the 
known effect of rapamydn on lymphocyte prdiferBtk>n, a 
potential problem with rapamydn analogues Is immunosup- 
pression, hlowever, although protonged Immunosuppression 
can result from rapamycin and 001-779 administered on 
continuous-dose schedules, the immunosuppressive effects 
of rapamydn analogues resolve in ^24 h after therapy 
(91). The prindpal toxicities of CCI-779 have Induded der- 
matological toxicity, myelosuppression, Infection, mucositis, 
diantiea, reversible elevations In liver function tests, hyper- 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase li trials of CCt-779 have been conducted In 
advanced renal cell carcinoma and in stage lll/IV breast 
canDinoma patients who failed with prior chemotherapy. In 
the results reported in abstract fornix although there were no 
complete responses, partial responses were documented In 
tx>th renal cell carcinoma and In breast cardnoma ^4, 95). 
Thus, CCI-779 has documented preliminary dinlcal activity in 
a previously treated, unsdected patient population. 

Active investigatton Is under way Into patient selection tor 
mTOR InhibitorB. Several studies have found an enhanced 
efficacy of CCI-779 in PTEN-null tumors (30, 96). Another 
study found that ^ of eight breast cancer cdl lines were 
responsive to CCI-779, although only two of these Ones 
lacked PHHEN l!97) There was. however, a positive conrdatlon 
between Akt acthratlon and CCI-779 sensitivity (97). This 
conelatlon suggests that activation of the PI3K-Akt pathway, 
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regardless of whether It is attributable to a PTEN mutation or 
to overexpression of receptor tyrosine idnases, makes can- 
cer cell amenable to mTOR*^lrected therapy. In contrast, 
lower levels of the target of mTOR, 4E-BP1. are associated 
with rapam/dn resistance; thus, a lower 4E-BP1/elF4E ratio 
nfiay predict rapamycln resistance (98). 

Another mode of activity for rapamydn and Its analogues 
appears to be through inhibition of anglogenesis. This activ- 
ity may be both through direct inhibition of endothelial cell 
proliferBtion as a result of mTOR inhibition in these cells or by 
inhibition of translation of such proangiogenic factors as 
vascular endothelial growth factor in tumor cells (99, 1 00). 

The angiogenesis inhibitor tumstatin, another anticancer 
drug cumentty undier study, was also found recently to inhibit 
translation in endothelial ceils (101). Through a requisite in- 
teraction with integpin, tumstatin Inhibits activation of the 
PI3K;Akt pathway and mTOR In endothelial cells and pre- 
vents dissodatlon of €lF4E from 4E-BP1. thereby inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targeting 
the mTOR-algnalIng pathway. 



ment also reduces the expression of angiogenic factors (116) 
and has been proposed as a potential adjuvant therapy for head 
and neck cancers, particularly vvhen elevated eIRE is 
surgical margins. Small molecule hhibttors that bind the eIF4<V 
4E-BP1-blndlng domain of elF4E are proapoptotic (116) and 
are idso being actively pursued. 



Exploiting Selective T)ranslatlon for Gene Therapy 
A diffmnt th^apeutlc approach that takes advantage of the 
enhanced cap-dependent translation in cancer cells is the use 
of gene therapy vectors encoding suidde genes with highly 
structured 5' UTFlTheseniRNA would thus be ataoompet^ 
disadvantage in normal cells and not translate well, whereas In 
cancer ceils, th^wouM translate mora efficiently. FbrQ}(ample, 
the introductkm of the 5' irm of fbroblast growth 
the coding sequence of herpes sbnpfex vfn^ type- 1 tfiymfdino 
Miase gene, allows for selecth/e trandatlcffi of /)ef^ 
vhJB type-l thymidine kinase gene In breast cancer cell lines 
compared with normal nnammaiy cefl lines and results in se- 
lective sensitivHy to ganciclovir (117). 



TargeOfg elF2€e £PA CfoMmazole^ mda-?, 
endHamnoUe 

EPA Is an n-3 polyunsaturated fatty acid found in the fish- 
based diets of populations having a k)w Inddence of cancer 
(102). EPA inhibits the proliferatkm of cancer cells (103), as 
well as In animal models (104, 105). It blocks cell division by 
inhibiting translation initiation (105). EPA releases Ca^^ from 
Intracellular stores while Inhibiting their refilling, thereby ac- 
tivating PKa PKR, In turn phosphorylates and inhibits elF2a, 
resulting In the Inhibition of protein synthesis at the level of 
translaUon initiation. Similarly, clotrimazole, a potent antipro- 
liferalive agent !n vitro and In vfvo, inhibits cell growth through 
depletion of Ca^"" stores, activation of PKR, and phospho- 
rylation of elF2a (106). Consequently, clotrima2X)le prsferen- 
tially decreases the expression of cyclins A, E. and D1, 
resulting In blockage of the cell cycle in Q^. 

mda-7 Is a novel tunrKW suppressor gene being developed 
as a gene therapy agent. Adenoviral transfer of mda-7 (Ad- 
mdaT) induces apoptosis In many cancer cells including 
breast, colorectal, and lung cancer (1 07-1 09). Ad-mda7 also 
Induces and activates PKR. which leads to phosphorylation 
of eiF2a and Induction of apoptosis (110), 

Flavonoids such as genlstein and quercetin suppress tu* 
nrtor cell growth. Ail three mammalian eiF2o kinases, PKR. 
heme-regulated Inhibitor, and PERK/PEK, are activated by 
flavonoids. with phosphorylaHon of elF2o and Inhibition of 
protein synthesis (111). 



Targeting elF4A and elF4E: Antbenee RNA 
end Peptides 

Antisense expression of elF4A decreases the prorrferatlon rate 
of nielanoma cells (1 12). Sequestration of elF^ by overexpres- 
sion of 4E-BP1 is proapoptotic and decraasea tumortgenlcity 
(113, 114). Reduction of elF4E wfth antisense mA decreases 
soft agar growth, increases tumor latency, and increases the 
rates of tumor doubling times (7), Antisense elF4E RNA treat- 



Toward the Future 

Translation is a cmdai process in every cell. However, several 
alterations In translaftional oontml occur In c 
appear to need an aberrantly activated transiatkmal state for 
siHVival, thus ^towing the targeting of translatton Initiation wfth 
surprisingly low toxfclty. Components of the translatlonal nna- 
chlnery. such as elF4E, and signal transduction pathway bv 
volved In translatkm Initiatfon, such mTDI=<. represent promisfng 
targets for cancer therapy. Inhibitors of the mTOR have already 
shown some preliminary activity In clinical trials. It Is possible 
that with the development of better predicth^ markers and 
beXi&r patient selection, response rates to single-agent therapy 
can be improved. Similar to other cytostatic agents, however, 
mTOR inhibitors are most likely to acNevie dinlcal utitfty In 
comblna[tion therapy. In the interim, our increasing understand- 
ing of translation initiation and signal transduction pathways 
promise to lead to the identifteation of new therapeutic targets 
In the near future. 
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Correlation between Protein and mRNA Abundance in Yeast 
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We have determined the relationship between mRNA and protein expression levels for selected genes 
expressed in the yeast Saccharomyces cerevisiae growing at mid-log phase. The proteins contained in total yeast 
cell lysate were separated by high-resolution two-dimensional (2D) gel electrophoresis. Over 150 protein spots 
were excised and identified by capillary liquid chromatography-tandem mass spectrometry (LC-MS/MS). 
Protein spots were quantified by metabolic labeling and scintillation counting. Corresponding mRNA levels 
were calculated from serial analysis of gene expression (SAGE) frequency tables (V. E. Velculescu, L. Zhang, 
W. Zhou, J. Vogelstein, M. A. Basrai, D. E. Bassett, Jr., P. Hieter, B. Vogelstein, and K, W. Kinzler, Cell 
88:243-251, 1997). We found that the correlation between mRNA and protein levels was insufficient to predict 
protein expression levels from quantitative mRNA data. Indeed, for some genes, while the mRNA levels were 
of the same value the protein levels varied by more than 20-fold. Conversely, invariant steady-state levels of 
certain proteins were observed with respective mRNA transcript levels that varied by as much as 30-fold. 
Another interesting observation is that codon bias is not a predictor of either protein or mRNA levels. Our 
results clearly delineate the technical boundaries of current approaches for. quantitative analysis of protein 
expression and reveal that simple deduction from mRNA transcript analysis is insufficient 



The description of the state of a biological system by the 
quantitative measurement of the system constituents is an es- 
sential but largely unexplored area of biology. With recent 
technical advances including the development of differential 
display-PCR (21), of cDNA microarray and DNA chip tech- 
nology (20, 27), and of serial analysis of gene expression 
(SAGE) (34, 35), it is now feasible to establish global and 
quantitative mRNA expression profiles of cells and tissues in 
species for which the sequence of all the genes is known. 
However, there is emerging evidence which suggests that 
mRNA expression patterns are necessary but are by them- 
selves insufficient for the quantitative description of biological 
systems. This evidence includes discoveries of posttranscrip- 
tional mechanisms controlling the protein translation rate (15), 
the half-lives of specific proteins or mRNAs (33), and the 
intracellular location and molecular association of tlie protein 
products of expressed genes (32). 

Proteome analysis, defined as the analysis of the protein 
complement expressed by a genome (26), has been suggested 
as an approach to the quantitative description of the state of a 
biological system by the quantitative analysis of protein expres- 
sion profiles (36). Proteome analysis is conceptually attractive 
because of its potential to determine properties of biological 
systems that are not apparent by DNA or mRNA sequence 
analysis alone. Such properties include the quantity of protein 
expression, the subcellular location, the state of modification, 
and the association with ligands, as well as the rate of change 
with time of such properties. In contrast to the genomes of a 
number of microorganisms (for a review, see reference 11) and 
the transcriptome of Saccharomyces cerevisiae (35), which have 
been entirely determined, ho proteome map has been com- 
pleted to date. 

The most common implementation of proteome analysis is 
the combination of two-dimensional gel electrophoresis (2DE) 
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(isoelectric focusing^sodium dodecyl sulfate [SDS]-polyacryl- 
amide gel electrophoresis) for the separation and quantitation 
of proteins with analytical methods for their identification. 
2DE permits the separation, visualization, and quantitation of 
thousands of proteins reproducibly on a single gel (18, 24). By 
itself, 2DE is strictly a descriptive technique. The combination 
of 2DE with protein analytical techniques has added the pos- 
sibility of establishing the identities of separated proteins (1, 2) 
and thus, in combination with quantitative mRNA analysis, of 
correlating quantitative protein and mRNA expression mea- 
surements of selected genes. 

The recent introduction of mass spectrometric protein anal- 
ysis techniques has dramatically enhanced the throughput and 
sensitivity of protein identification to a level which now permits 
the large-scale analysis of proteins separated by 2DE. The 
techniques have reached a level of sensitivity that permits the 
identification of essentially any protein that is detectable in the 
gels by conventional protein staining (9, 29). Current protein 
analytical technology is based on the mass spectrometric gen- 
eration of peptide fragment patterns that are idiotypic for the 
sequence of a protein. Protein identity is established by corre- 
lating such fragment patterns with sequence databases (10, 22, 
37). Sophisticated computer software (8) has automated the 
entire process such that proteins are routinely identified with 
no human interpretation of peptide fragment patterns. 

In this study, we have analyzed the mRNA and protem levels 
of a group of genes expressed in exponentially growing cells of 
the yeast 5. cerevisiae. Protein expression levels were quantified 
by metabolic labeling of the yeast proteins to a steady state, 
followed by 2DE and liquid scintillation counting of the se- 
lected, separated protein species. Separated proteins were 
identified by in-gel tryptic digestion of spots with subsequent 
analysis by microspray liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) and sequence database searching. 
The corresponding mRNA transcript levels were calculated 
from SAGE frequency tables (35). 

This study, for the first time, explores a quantitative com- 
parison of mRNA transcript and protein expression levels for 
a relatively large number of genes e?q)ressed in the same met- 
abolic state. The resultant correlation is insufficient for predic- 
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FIG. 1. Schematic illustration of proteome analysis by 2DE and mass spectrometiy. In part I, proteins are separated by 2DE, stained spots are excised and subjected 
to in-gel digestion with trypsin, and the resulting peptides are separated by on-line capillary hi^-performance liquid chromatography. In part II, a peptide is shown 
cluting from the column in part I. The peptide is ionized by electrospray ionization and enters the mass spectrometer. The mass of the ionized peptide is detected, and 
the first quadrupole mass filter allows only the specific mass-to-chargc ratio of the selected peptide ion to pass into the collision cell. In the collision cell, the energized, 
ionized peptides collide with neutral argon gas molecules. Fragmentation of the peptide is essentially random but occurs mainly at the peptide bonds, resulting in smaller 
peptides of dififering lengths (masses). These peptide fragments are detected as a tandem mass (MS/MS) spectrum in the third quadrupole mass filter where two ion 
series are recorded simultaneously, one each from sequencing inward from the N and C termini of the peptide, respectively. In part III, the MS/MS spectrum from the 
selected, ionized peptide is compared to predicted tandem mass spectra computer generated from a sequence database. Provided that the peptide sequence exists in 
the database, the peptide and, by association, the protein from which the peptide was derived can be identified. Unambiguous protein identification is attained in a single 
analysis because multiple peptides are identified as being derived from the same protein. 



tion of protein levels from mRNA transcript levels. We have 
also compared the relative amounts of protein and mRNA 
with the respective codon bias values for the corresponding 
genes. This comparison indicates that codon bias by itself is 
insufficient to accurately predict either the mRNA or the pro- 
tein expression levels of a gene. In addition, the results dem- 
onstrate that only highly expressed proteins are detectable by 
2DE separation of total cell lysates and that therefore the 
construction of complete proteome maps with current technol- 
ogy will be veiy challenging, irrespective of the type of organ- 
ism. 

MATERIALS AND METHODS 

Yeast strain and growth conditions. The source of protein and message tran- 
scripts for all experiments was YPH499 (MATb uraS-Sl fys2-801 adel-lOl 
Ieu2-M his3-^200 trpl-MS) (30). Logarithmically growing cells were obtained by 
growing yeast cells to early log phase (3 X 10* cells/ml) in YPD rich medium 
(YPD supplemented with 6 mM uracil, 4.8 mM adenine, and 24 mM tiyptophan) 
at 3(fC {35), Metabolic labeling of protein was accomplished in YPD medium 



exactly as described elsewhere (4) with the exception that 1 ml of cells was 
labeled with 3 mCi to ofi^t methionine present in YPD medium. Protein was 
harvested as described by Garrels and coworkers (12). Harvested protein was 
lyophilized, resuspended in isoelectric focusing gel rehydration solution, and 
stored at -80*C 

2DE. Soluble proteins were run in the first dimension by using a commercial 
flatbed electrophoresis system (Multiphor II; Pharmacia Biotech). Immobilized 
polyacrylamide gel (IPG) diy strips with nonlinear pH 3,0 to 10.0 gradients 
(Amersham-Pharmacia Biotrah) were used for the first-dimension separation. 
Forty micrograms of protein from whole-cell lysates was mked with IPG strip 
rehydration buffer (8 M urea, 2% Nonidet P-40, 10 mM dithiothreitol), and 250 
to 380 \l\ of solution was added to individual lanes of an IPG strip rehydration 
tray (Amersham-Pharmacia Biotech). The strips were allowed to rehydrate at 
room temperature for 1 h. The samples were run at 300 V-10 mA-5 W for 2 h, 
then ramped to 3,500 V-10 mA-5 W over a period of 3 h, and then kept at 3,500 
V-10 mA-5 W for 15 to 19 h. At the end of the first-dimension run (60 to 70 kV • 
h), the IPG strips were reequilibrated for 8 min in 2% (wt/vol) dithiothreitol in 
2% (wl^ol) SDS-6 M urea-30% (wt/vol) glycerol-0.05 M Tris HQ (pH 6.8) and 
for 4 min in 2.5% iodoacetamide in 2% (wt/vol) SDS-6 M uiear-30% (wt/vol) 
glycerol-0.05 M Tris HCl (pH 6.8). Following reequilibration, the strips were 
transferred and apposed to 10% polyacrylamide second-dimension gels. Poly- 
acrylamide gels were poured in a casting stand with 10% acrylamid&>2.67% 
piperazine diaciylamide-0375 M Tm base-HQ (pH 8^.1% (wtArol) SDS-a05% 
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FIG. 2. 2D silver-stained gel of the proteins in yeast total cell lysate. Proteins were separated in the first dimension (horizontal) by isoelectric focusing and then in 
the second dimension (vertical) by molecular weight sieving. Protein spots (156) were chosen to include the entire range of molecular weights, isoelectric focusing points, 
and staining intensities. Spots were excised, and the corresponding protein was identified by mass spectrometiy and database searching. The spots are labeled on the 
gel and correspond to the data presented in Table 1. Molecular weights are given in thousands. 



(wtA'ol) ammonium persulfate-0.05% TEMED j(yV^^'//'-tetramethyIethyl- 
enediamine) in Milli-Q water. The apparatus used to run second-dimension gels 
was a noncommercial apparatus from Oxford Glycosciences, Inc. Once the IPG 
strips were apposed to the second-dimension gels, they were immediately run at 
50 mA (constanl)-500 V-85 W for 20 min, followed by 200 mA (constant)-500 
V-85 W until the buffer front line was 10 to 15 mm from the bottom of the gel. 
Gels were removed and silver stained according to the procedure of Shevchenko 
et al. (29). 

Protein identification. Gels were exposed to X-ray film overnight, and then the 
silver staining and film were used to excise 156 spots of varying intensities, 
molecular weights, and isoelectric focusing points. In order to increase the 
detection limit by mass spectrometry, spots were cut out and pooled from up to 
four identical cold, silver-stained gels. In-gel tryptic digests of pooled spots were 
performed as described previously (29). Tiyptic peptides were analyzed by mi- 
crocapillary LC-MS with automated switching to MS/MS mode for peptide 
fragmentation. Spectra were searched against the composite OWL protein se- 
quence database (version 30.2; 250,514 protein sequences) (24a) by using the 
computer program Sequest (8), which matches theoretical and acquired tandem 
mass spectra. A protein match was determined by comparing the number of 
peptides identified and their respective cross-correlation scores. All protein 
identifications were verified by comparison with theoretical molecular weights 
and isoelectric points. 



mRNA quantitation. Velculescu and coworkers have previously generated 
frequency tables for yeast mRNA transcripts from the same strain grown under 
the same stated conditions as described herein (35). The SAGE technology is 
based on two main principles. First, a short sequence tag (15 bp) that contains 
sufficient information uniquely to identify a transcript is generated. A single tag 
is usually generated from each mRNA transcript in the cell which corresponds to 
15 bp at the 3 '-most cutting site for NZalll. Second, many transcript tags can be 
concatenated into a single molecule and then sequenced, revealing the identiQr of 
multiple tags simultaneously. Over 20,000 transcripts were sequenced from yeast 
strain YPH499 growing at mid>log phase on glucose. Assuming the previously 
derived estimate of 15,000 mRNA molecules per cell (16), this would represent 
a 1.3-fold coverage even for mRNA molecules present at a single copy per cell 
and would provide a 72% probability of detecting such transcripts. Computer 
software which took for input the gene detected, examined the nucleotide se- 
quence, and performed the calculation as described by Velculescu and cowoikcrs 
(35) was written. In practice, we found that for 21 of 128 (16%) genes examined 
viable mRNA levels from SAGE data could not be calculated. This was because 
(i) no CATG site was found in the open reading frame (ORF), (ii) a CATG site 
was found but the corresponding 10-bp putative SAGE tag was not found in the 
frequency tables, or (iii) identical putative SAGE tags were present for multiple 
genes (eg.. TOH2_YEAST and TOH3_YEAST). 
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TABLE 1. Expressed genes identified from 2D gel in Fig. 2 TABLE l-^Continued 
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2.8 


3.7 
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75,396 


5.82 


6 


GRSl 


5.5 


7.4 


0.500 


85,720 


6.25 


1 


MET6 


2.0 


NA*^ 
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85.720 


6.25 


2 


MET6 


10.9 


NA' 


0.772 


85.720 


6.25 


3 


MET6 


1.4 


NA*^ 


0.772 


93,276 


6.11 


131 


EFTl 


17,9 


41.6 


0.890 


93,276 
102,064* 


6.11 


132 


EFTl 


5.7 


41.6 


0.890 


6.61* 


94 


ADE3 


4.8 


5.2 


0.423 


107,482* 


5,33^^ 


4 


MCM3 


2.7 


NA^ 


0,240 



" YPD gene names are available from the YPD website (39). 

NA, calculation could not be performed or was not available, 

mRNA data inconclusive or NA. 
^ No methionines in predicted ORF; therefore, protein concentration was not 
determined. 

' Measured molecular weight or pi did not match theoretical molecular weight 
or pi. 



Protein quantitation. [^^S]methionine-labeled gels were exposed to X-ray film 
overnight, and then the silver stain and film were used to excise 156 spots of 
varying intensities, molecular weights, and pis. The excised spots were placed in 
0.6-ml microcentrifuge tubes, and scintillation cocktail (100 \iX) was added. The 
samples were vortexed and counted. In addition, two parallel gels were electro- 
blotted to polyvinylidene difluoride membranes. The membranes were exposed 
to X-ray film, and four intense single spots were excised from each membrane 
and subjected to amino acid analysis. For these four spots, a mean of 209 ± 4 
cpm/pmol of protein/methionine was found. This number was used to quantitate 
all remaining spots in conjunction with the number of methionines present in the 
protein. 

To ensure that proteins were labeled to equilibrium, parallel 2D gels were 
prepared and run on yeast metabolically labeled for 1, 2, 6, or 18 h. The 
corresponding 156 spots were excised from each get, and radioactivity was mea- 
sured by liquid scintillation counting for each spot. Calculated protein levels were 
highly reproducible for all time points measured after 1 h. 

Calculation of codon bias and predicted half-life. Codon bias values were 
extracted from the YPD spreadsheet (17). Protein half-lives were calculated 
based on the N-end rule (33). When the N-terminal processing was not known 
experimentally, it was predicted based on the affinity of methionine aminopep> 
tidase (31). 

RESULTS 

Characteristics of proteome approach. Nearly every facet of 
proteome analysis hinges on the unambiguous identification of 
large numbers of expressed proteins in cells. Several tech- 
niques have been described previously for the identification of 
proteins separated by 2DE, including N-terminal and internal 
sequencing (1, 2), amino acid analysis (38), and more recently 
mass spectrometry (25). We utilized techniques based on mass 
spectrometry because they afford the highest levels of sensitiv- 
ity and provide unambiguous identification. The specific pro- 
cedure used is schematically illustrated in Fig. 1 and is based 
on three principles. First, proteins are removed from the gel by 



proteolytic in-gel digestion, and the resulting peptides are sep- 
arated by on-line capillary high-performance liquid chromatog- 
raphy. Second, the eluting peptides are ionized and detected, and 
the specific peptide ions are selected and fragmented by the 
mass spectrometer. To achieve this, the mass spectrometer 
switches between the MS mode (for peptide mass identifica- 
tion) and the MS/MS mode (for peptide characterization and 
sequencing). Selected peptides are fragmented by a process 
called collision-induced dissociation (CID) to generate a tan- 
dem mass spectrum (MS/MS spectrum) that contains the pep- 
tide sequence information. Third, individual CID mass spectra 
are then compared by computer algorithms to predicted spec- 
tra from a sequence database. This results in the identification 
of the peptide and, by association, the protein(s) in the spot. 
Unambiguous protein identification is attained in a single anal- 
ysis by the detection of multiple peptides derived from the 
same protein. 

Protein identification. Yeast total cell protein lysate (40 ^jig), 
metabolically labeled with p^S]methionine, was electro- 
phoretically separated by isoelectric focusing in the first dimen- 
sion and by SDS-10% polyacrylamide gel electrophoresis in 
the second dimension. Proteins were visualized by silver stain- 
ing and by autoradiography. Of the more than 1,000 proteins 
visible by silver staining, 156 spots were excised from the gel 
and subjected to in-gel tryptic digestion, and the resulting 
peptides were analyzed and identified by microspray LC- 
MS/MS techniques as described above. The proteins in this 
study were all identified automatically by computer software 
with no human interpretation of mass spectra. They are indi- 
cated in Fig. 2 and detailed in Table 1. 

The CID spectra shown in Fig. 3 indicate that the quality of 
the identification data generated was suitable for unambiguous 
protein identification. The spectra represent the amino acid 
sequences of tryptic peptides NSGDIVNLGSIAGR (Fig. 3A) 
and FAVGAFTDSLR (Fig. 3B). Both peptides were derived 
from protein S57593 (hypothetical protein YMR226C), which 
migrated to spot 114 (molecular weight, 29,156; pi, 6.59) in the 
2D gel in Fig. 2. Five other peptides from the same analysis 
were also computer matched to the same protein sequence. 

Protein and mRNA quantitation. For the 156 genes investi- 
gated, the protein expression levels ranged from 2,200 (PGM2) 
to 863,000 (TDH2AT)H3) copies/cell. The levels of mRNA for 
each of the genes identified were calculated from SAGE fre- 
quency tables (35). These tables contain the mRNA levels for 
4,665 genes in yeast strain YPH499 grown to mid-log phase in 
YPD medium on glucose as a carbon source. In some in- 
stances, the mRNA levels could not be calculated for reasons 
stated in Materials and Methods. For the proteins analyzed in 
this study, mean transcript levels varied from 0.7 to 473 copies/ 
cell. 

Selection of the sample population for mRNA-protein ex- 
pression level correlation. The protein spots selected for iden- 
tification were selected from spots visible by silver staining in 
the 2D gel. An attempt was made not to include spots where 
overlap with other spots was readily apparent. The number of 
proteins identified was 156 (Table 1). Some proteins migrated 
to more than one spot (presumably due to differential protein 
processing or modifications), and protein levels from these 
spots were calculated by integrating the intensities of the dif- 
ferent spots. The 156 protein spots analyzed represented the 
products of 128 different genes. Genes were excluded from the 
correlation analysis only if part of the data set was missing; i.e., 
genes were excluded if (i) no mRNA expression data were 
available for the protein or putative SAGE tags were ambig- 
uous, (ii) the amino acid sequence did not contain methionine, 
(iii) more than a single protein was conclusively identified as 
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FIG. 3. Tandem mass (MS/MS) spectra resulting from analysis of a single spot on a 2D gel. The first quadrupole selected a single mass-to-chargc ratio (m/z) of 687.2 
(A) or 592.6 (B), while the collision cell was filled with argon gas, and a voltage which caused the peptide to undergo fragmentation by CID was applied. The third 
quadrupole scanned the mass range from 50 to 1,400 m/z. The computer program Sequest (8) was utilized to match MS/MS spectra to amino acid sequence by database 
searching. Both spectra matched peptides from the same protein, SS7593 (yeast hypothetical protein YMR226C). F'we other peptides from the same analysis were 
matched to the same protein. 



migrating to the same gel spot, or (iv) the theoretical and 
observed pis and molecular weights could not be reconciled. 
After these criteria were applied, the number of genes used in 
the correlation analysis was 106. 



Codon bias and predicted half-lives. Codon bias is thought 
to be an indicator of protein expression, with highly expressed 
protems having large codon bias values. The codon bias distri- 
bution for the entire set of more than 6,000 predicted yeast 
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gene ORFs is presented in Fig. 4A, The interval with the 
largest frequency of genes is between the codon bias values of 
0.0 and 0.1. This segment contains more than 2,500 genes. The 
distribution of the codon bias values of the 128 different genes 
found in this study (all protein spots from Fig. 2) is shown in 
Fig. 4B, and protein half-lives (predicted from applying the 
N-end rule [33] to the experimentally determined or predicted 
protein N termini) are shown in Fig. 4C. No genes were iden- 
tified with codon bias values less than 0.1 even though thou- 
sands of genes exist in this category. In addition, nearly all of 
the proteins identified had long predicted half-lives (greater 
than 30 h). 

Correlation of mRNA and protein expression levels. The 
correlation between mRNA and protein levels of the genes 
selected as described above is shown in Fig. 5. For the entire 
group (106 genes) for which a complete data set was gener- 
ated, there was a general trend of increased protein levels 
resulting from increased mRNA levels. The Pearson product 
moment correlation coefficient for the whole data set (106 
genes) was 0.935. This number is highly biased by a small 
number of genes with very large protein and message levels. A 
more representative subset of the data is shown in the inset of 
Fig. 5. It shows genes for which the message level was below 10 
copies/cell and includes 69% (73 of 106 genes) of the data used 
in the study. The Pearson product moment correlation coeffi- 
cient for this data set was only 0.356. We also found that levels 
of protein expression coded for by mRNA with comparable 
abundance varied by as much as 30-fold and that the mRNA 
levels coding for proteins with comparable expression levels 
varied by as much as 20-fold. 

The distortion of the correlation value induced by the un- 
even distribution of the data points along the x axis is further 
demonstrated by the analysis in Fig. 6. The 106 samples in- 
cluded in the study were ranked by protein abundance, and the 
Pearson product moment correlation coefficient was repeat- 
edly calculated after including progressively more, and higher- 
abundance, proteins in each calculation. The correlation values 
remained relatively stable in the range of 0.1 to 0.4 if the 
lowest-expressed 40 to 95 proteins used in this study were 
included. However, the correlation value steadily climbed by 
the inclusion of each of the 11 very highly expressed proteins. 

Correlation of protein and mRNA expression levels with 
codon bias. Codon bias is the propensity for a gene to utilize 
the same codon to encode an amino acid even though other 
codons would insert the identical amino acid iii the growing 
polypeptide sequence. It is further thought that highly ex- 
pressed proteins have large codon biases (3). To assess the 
value of codon bias for predicting mRNA and protein levels in 
exponentially growing yeast cells, we plotted the two experi- 
mental sets of data versus the codon bias (Fig. 7). The distri- 
bution patterns for both mRNA and protein levels with respect 
to codon bias were highly similar. There was high variability in 
the data within the codon bias range of 0,8 to 1.0. Although a 
large codon bias generally resulted in higher protein and mes- 
sage expression levels, codon bias did not appear to be predic- 
tive of either protein levels or mRNA levels in the cell. 

DISCUSSION 

The desired end point for the description of a biological 
system is not the analysis of mRNA transcript levels alone but 
also the accurate measurement of protein expression levels and 
their respective activities. Quantitative analysis of global 
mRNA levels currently is a preferred method for the analysis 
of the state of cells and tissues (11). Several methods which 
either provide absolute mRNA abundance (34, 35) or relative 
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FIG. 4. Current proteome analysis technology utilizing 2DE without preen- 
richment samples mainly highly expressed and long-lived proteins. Genes encod- 
ing highly expressed proteins generally have large codon bias values. (A) Distri* 
bution of the yeast genome (more than 6,000 genes) based on codon bias. The 
inteival with the largest frequency of genes is 0.0 to 0.1, with more than 2,500 
genes, (B) Distribution of the genes from identified proteins in this study based 
on codon bias. No genes with codon bias values less than 0.1 were detected in this 
study. (C) Distribution of identified proteins in this study based on predicted 
half-life (estimated by N-end rule). 



mRNA levels in comparative analyses (20, 27) have been de- 
scribed elsewhere. The techniques are fast and exquisitely sen- 
sitive and can provide mRNA abundance for potentially any 
expressed gene. Measured mRNA levels are often unplicitly or 
explicitly extrapolated to indicate the levels of activity of the 
corresponding protein in the cell. Quantitative analysis of pro- 
tein expression levels (proteome analysis) is much more time- 
consuming because protems are analyzed sequentially one by 
one and is not general because anadyses are limited to the 
relatively highly expressed proteins. Proteome analysis does, 
however, provide types of data that are of critical unportance 
for the description of the state of a biological system and that 
are not readily apparent from the sequence and the level of 
expression of the mRNA transcript. This study attempts to 
examine the relationship between mRNA and protein expres- 
sion levels for a large number of expressed genes in cells 
representing the same state. 

Limits in the sensitivity of current protein analysis technol- 
ogy precluded a completely random sampling of yeast proteins. 
We therefore based the study on those proteins visible by sihrer 



VOL 19, 1999 



CORRELATION BETWEEN PROTEIN AND mRNA LEVELS IN YEAST 1727 



1000 -i 



S 750 - 



100 



a 
o 
o 



c 
c 

3 

JQ 
CO 

£ 

1 

Q. 



250 - 




10,0 




50 



100 



150 



200 



250 



300 



mRNA abundance (copies/cell) 



FIG. 5. Correlation between protein and mRNA levels for 106 genes in yeast growing at log phase with glucose as a carbon source. mRNA and protein levels were 
calculated as described in Materials and Methods. The data represent a population of genes with protein expression levels visible by silver staining on a 2D gel chosen 
to include the entire range of molecular weights, isoelectric focusing points, and staining intensities. The inset shows the low-end portion of the main figure. It contains 
69% of the original data set. The Pearson product moment correlation for the entire data set was 0.935. The correlation for the inset containing 73 proteins (69%) was 
only 0356. 



Staining on a 2D gel. Of the more than 1,000 visible spots, 156 
were chosen to include the entire range of molecular weights, 
isoelectric focusing points, and staining intensities displayed on 
the 2D protein pattern. The genes identified in this study 
shared a number of properties. First, all of the proteins in this 
study had a codon bias of greater than 0.1 and 93% were 
greater than 0.2 (Fig. 4B). Second, with few exceptions, the 
proteins in this study had long predicted half-lives according to 
the N-end rule (Fig. 4C). Third, low-abundance proteins with 
regulatory functions such as transcription factors or protein 
kinases were not identified. 

Because the population of proteins used in this study ap- 
pears to be fairly homogeneous with respect to predicted half- 
life and codon bias, it might be expected that the correlation of 
the mRNA and protein expression levels would be stronger for 
this population than for a random sample of yeast proteins. We 
tested this assumption by evaluating the correlation value if 
different subsets of the available data were included in the 
calculation. The 106 proteins were ranked from lowest to high- 
est protein expression level, and the trend in the correlation 
value was evaluated by progressively including more of the 
higher-abundance proteins in the calculation (Fig. 6). The cor- 
relation value when only the lower-abundance 40 to 93 pro- 
teins were examined was consistently between 0.1 and 0.4. If 
the 11 most abundant proteins were included, the correlation 
steadily increased to 0.94. We therefore expect that the corre- 
lation for all yeast proteins or for a random selection would be 
less than 0.4. The observed level of correlation between 
mRNA and protein expression levels suggests the importance 



of posttranslational mechanisms controlling gene expression. 
Such mechanisms include translational control (15) and con- 
trol of protein half-life (33). Since these mechanisms are also 
active in higher eukaryotic cells, we speculate that there is no 
predictive correlation between steady-state levels of mRNA 
and those of protein in mammalian cells. 

Like other large-scale analyses, the present study has several 
potential sources of error related to the methods used to de- 
termine mRNA and protein expression levels. The mRNA 
levels were calculated from frequency tables of SAGE data. 
This method is highly quantitative because it is based on actual 
sequencing of unique tags from each gene, and the number of 
times that a tag is represented is proportional to the number of 
mRNA molecules for a specific gene. This method has some 
limitations including the following: (i) the magnitude of the 
error in the measurement of mRNA levels is inversely propor- 
tional to the mRNA levels, (ii) SAGE tags from highly similar 
genes may not be distinguished and therefore are summed, (iii) 
some SAGE tags are from sequences in the 3' untranslated 
region of the transcript, (iv) incomplete cleavage at the SAGE 
tag site by the restriction enzyme can result in two tags repre- 
senting one mRNA, and (v) some transcripts actually do not 
generate a SAGE tag (34, 35). 

For the SAGE method, the error associated with a value 
increases with a decreasing number of transcripts per cell. The 
conclusions drawn from this study are dependent on the qual- 
ity of the mRNA levels from previously published data (35). 
Since more than 65% of the mRNA levels included in this 
study were calculated to 10 copies/cell or less (40% were less 
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FIG. 6. Effect of highly abundant proteins on Pearson product moment correlation coefficient for mRNA and protein abundance in yeast. The set of 106 genes was 
ranked according to protein abundance, and the correlation value was calculated by including the 40 lowest-abundance genes and then progreissively induding the 
remaining 66 genes in order of abundance. The correlation value climbs as the final 11 highly abundant proteins are included. 



.than 4 copies/cell), the error associated with these values may 
be quite large. The mRNA levels were calculated from more 
than 20,000 transcripts. Assuming that the estimate of 15,000 
mRNA molecules per cell is correct (16), this would mean that 
mRNA transcripts present at only a single copy per cell would 
be detected 72% of the time (35). The mRNA levels for each 
gene were carefully scrutinized, and only mRNA levels for 
which a high degree of confidence existed were included in the 
correlation value. 

Protein abundance was determined by metabolic radiolabel- 
. ing with [^^S]methionine. The calculation required knowledge 
of three variables: the number of methionines in the mature 
protein, the radioactivity contained in the protein, and the 
specific activity of the radiolabel normalized per methionine. 
The number of methionines per protein was determined from 
the amino acid sequence of the proteins identified by tandem 
mass spectrometry. For some proteins, it was not known 
whether the methionine of the nascent polypeptide was pro- 
cessed away. The N termini of those proteins were predicted 
based on the specificity of methionine aminopeptidase (31). If 
the N-terminal processing did not conform to the predicted 
specificity of processing enzymes, the calculation of the num- 
ber of methionines would be affected. This discrepancy would 
affect most the quantitation of a protein with a very low num- 
ber of methionines. The average number of calculated methi- 
onines per protein in this study was 7.2. We therefore expect 
the potential for erroneous protein quantitation due to un- 
usual N-terminal processing to be small. 



The amount of radioactivity contained in a single spot might 
be the sum of the radioactivity of comigrating proteins. Be- 
cause protein identification was based on tandem mass spec- 
trometric techniques, comigrating proteins could be identified. 
However, comigrating proteins were rarely detected in this 
study, most likely because relatively small amounts of total 
protein (40 fJig) were initially loaded onto the gels, which re- 
sulted in highly focused spots containing generally 1 to 25 ng of 
protein. Because of the relatively small amount loaded, the 
concentrations of any potentially comigrating protein would 
likely be below the limit of detection of the mass spectrometry 
technique used in this study (1 to 5 ng) and below the limit of 
visualization by silver staining (1 to 5 ng). In the overwhelming 
majority of the samples analyzed, numerous peptides from a 
single protein were detected. It is assumed that any comigrat- 
ing proteins were at levels too low to be detected and that their 
influence in the calculation would be small. 

The specific activity of the radiolabel was determined by 
relating the precise amount of protein present in selected spots 
of a parallel gel, as determined by quantitative amino acid 
composition analysis, to the number of methionines present in 
the sequence of those proteins and the radioactivity deter- 
mined by liquid scintillation counting. It is possible that the 
resulting number might be influenced by unavoidable losses 
inherent in the amino acid analysis procedure applied. Because 
four different proteins were utilized in the calculation and the 
experiment was done in duplicate, the specific activity calcu- 
lated is thought to be highly accurate. Indeed, the specific 
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FIG. 7. Relationship between codon bias and protein and mRNA levels in this study. Yeast mRNA and protein expression levels were calculated as described in 
Materials and Methods. The data represent the same 106 genes as in Fig. 5. 



activities calculated for each of the four proteins varied by less 
than 10%. Any inconsistencies in the calculation of the specific 
activity would result in differences in the absolute levels calcu- 
lated but not in the relative numbers and would therefore not 
influence the correlation value determined. 

The protein quantitative method used eliminates a number 
of potential errors inherent in previous methods for the quan- 
titation of proteins separated by 2DE, such as preferential 
protein staining and bias caused by inequalities in the number 
of radiolabeled residues per protein. Any 2D gel-based method 
of quantitation is complicated by the fact that in some cases the 
translation products of the same mRNA migrated to different 
spots. One major reason is posttranslational modification or 
processing of the protein. Also, artifactual proteolysis during 
cell lysis and sample preparation can lead to multiple resolved 
forms of the protein. In such cases, the protein levels of spots 
coded for by the same mRNA were pooled. In addition, the 
existence of other spots coded for by the same mRNA that 
were not analyzed by mass spectrometry or that were below the 
limit of detection for silver staining cannot be ruled out. How- 
ever, since this study is based on a class of highly expressed 
proteins, the presence of undetected minor spots below silver 
staining sensitivity corresponding to a protein analyzed in the 
study would generally cause a relatively small error in protein 
quantitation. 

Codon bias is a measure of the propensity of an organism to 
selectively utilize certain codons which result in the incorpo- 
ration of the same amino acid residue in a growing polypeptide 
chain. There are 61 possible codons that code for 20 amino 
acids. The larger the codon bias value, the smaller the number 
of codons that are used to encode the protein (19). It is 



thought that codon bias is a measure of protein abundance 
because highly expressed proteins generally have large codon 
bias values (3, 13). 

Nearly all of the most highly expressed proteins had codon 
bias values of greater than 0.8. However, we detected a number 
of genes with high codon bias and relative low protein abun- 
dance (Fig, 7). For example, the caressed gene with both the 
second largest protein and mRNA levels in the study was 
EN02_YEAST (775,000 and 289.1 copies/cell, respectively). 
EN01_YEAST was also present in the gel at much lower 
protein and mRNA levels (44,200 and 0.7 copies/cell, respec- 
tively). The codon bias values for EN02 and ENOl are similar 
(0.96 and 0.93, respectively), but the expression of the two 
genes is differentially regulated. Specifically, ENOl^YEAST is 
glucose repressed (6) and was therefore present in low abun- 
dance under the conditions used. Other genes with large codon 
bias values that were not of high protein abundance in the gel 
include EFTl, TlFl, HXK2, GSPl, EGD2, SHM2, and TALI. 
We conclude that merely determining the codon bias of a gene 
is not sufficient to predict its protein expression level. 

Interestingly, codon bias appears to be an excellent indicator 
of the boundaries of current 2D gel proteome analysis tech- 
nology. There are thousands of genes with expressed mRNA 
and likely expressed protein with codon bias values less than 
0.1 (Fig. 4A). In this study, we detected none of them, and only 
a very small percentage of the genes detected in this study had 
codon bias values between 0.1 and 0.2 (Fig. 4B). Indeed, in 
every examined yeast proteome study (5, 7, 13, 28) where the 
combined total number of identified proteins is 300 to 400, this 
same observation is true. It is expected that for the more 
complex cells of higher eukaiyotic organisms the detection of 
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low-abundance proteins would be even more challenging than 
for yeast. This indicates that highly abundant, long-lived pro- 
teins are overwhelmingly detected in proteome studies. If pro- 
teome analysis is to provide truly meaningful information 
about cellular processes, it must be able to penetrate to the 
level of regulatory proteins, including transcription factors and 
protein kinases. A promising approach is the use of narrow- 
range focusing gels with immobilized pH gradients (IPG) (23). 
This would allow for the loading of significantly more protein 
per pH unit covered and also provide increased resolution of 
proteins with similar electrophoretic mobilities. A standard pH 
gradient in an isoelectric focusing gel covers a 7-pH-unit range 
(pH 3 to 10) over 18 cm. A narrow-range focusing gel might 
expand the range to 0.5 pH units over 18 cm or more. This 
could potentially increase by more than 10-fold the number of 
proteins that can be detected. Clearly, current proteome tech- 
nology is incapable of analyzing low-abundance regulatory pro- 
teins without employing an enrichment method for relatively 
low-abundance proteins. In conclusion, this study examined 
the relationship between yeast protein and message levels and 
revealed that transcript levels provide little predictive value 
with respect to the extent of protein expression. 
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